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LIGANDS FOR I7L AS MODULATORS OF ORTHOPOX VIRUSES AND 
METHODS FOR DISCOVERY THEREOF 

RELATED APPLICATIONS 
This patent application claims priority to provisional patent application Serial 
5 No. 60/529,3 84, filed December 12, 2003. The disclosure of provisional patent 

application Serial No. 60/529,384 is hereby incorporated by reference in its entirety. 

FIELD OF INVENTION 
The present invention relates to methods of discovery. The methods may be 
advantageous for discovering compounds that alter a biological activity of a molecule 
10 of interest. The present invention also provides anti-viral compounds that may be 
identified using such methods. 

BACKGROUND 

Viruses are obligatory intracellular parasites that can take over host cell 
transcription and translation to produce new viral particles. Interception of viral- 

1 5 driven transcription or translation, including both pre- and post-translation events, 
may result in crippling of the vims. 

Smallpox, a member of the ortiiopox family of viruses, has recently 
resurfaced as a public heath concern. Until the last several years, the production of 
vaccines and therapeutics to combat smallpox was not considered necessary, as the 

20 last known case of smallpox was reported in 1977 in Somalia. In feet, universal 
vaccination in the United States was discontinued in 1972, since the risk of 
complications firom the vaccine was actually greater than die risk of being infected 
with the disease. As a result, a portion of the population has never been vaccinated 
and thus, may be susceptible to infection by newly enierging strains of smallpox and 

25 other orthopox viruses. 

Small molecule chemotherapy may be an alternative to vaccination for the 
prevention and/or treatment of orthopox vimses. For example, since the discovery of 
non-nucleoside reverse transcriptase inhibitors (NNRTls), and protease inhibitors 
(Pis), several classes of organic molecules have been designed to combat viral 

30 infections by inhibiting targets responsible for viral replication and morphogenesis. 
Due to the highly homologous nature of the orthopox family, therapeutics developed 
against smallpox may also be potential candidate therapies for related viruses such as 
monkeypox, a virus that recently reemerged m AMca and spread to the United States 
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by importation of exotic animals, and muUuscipox virus, a common cutaneous 
infection that may be problematic ia immunocompromised individuals. 

To date, no small molecule antiviral drug has proven to be effective in the 
treatment of smallpox. The only antiviral agent currently approved for use against 
5 orthopoxviruses is cidofovir, a DNA polymerase inhibitor that may be used to treat 
cytomegalovirus and other herpes viruses. However, the usefulness of cidofovir may 
be limited in that the drug exhibits low bio-availabilily when administered orally, and 
thus, must be administered intravenously (Cundy, K.C., 1999, Clin, Pharmacokinet., 
36:127-143). 

10 Because proteolysis catalyzed by viral-encoded proteases can be a necessary 

step in the development cycle of most viruses, protease inhibitors may provide 
another class of drugs that act as anti-viral agents. For example, protease inhibitors 
have proven to be effective against human immunodeficiency virus (HIV), influenza, 
hepatitis C, and rhinovirus enzymes. 
' 1 5 During replication of vaccinia virus (W), a prototypic member of the 

orthopox family, two types of proteolytic processing occur: formative and 
morphogenic. I7L is a protease involved in the maturation of the core protein of the 
orthopoxvirus virion. I7L appears to be involved in an obligatory morphogenic 
cleavage of three major structural proteins found in the mature W virion: 4a, 4b, and 

20 25KL I7L protease is a 47 kDa cysteine protease that contains putative catalytic 

histidine and cysteine residues embedded in a conserved region containing an aspa3rtic 
acid residue. The gene for I7L appears to be highly conserved among poxvimses, as 
the identity among I7L genes between variola virus and vaccinia virus is 99%, and 
I7L genes from aU orthopox vimses also appear to possess a large degree of 

25 homology. The importance of I7L has been underscored in studies with temperature- 
sensitive (ts) viruses in which the I7L gene ha£; been shown to be essential for viral 
replication using a conditional lethal mutant, ts\6y that maps to this locus (Byrd, 
CM., et aL, Virology, 2003, 77:1 1279-1 1283; Byrd, CM., et al.. Virology, 2002, 
76:8973-8976). 

30 The resurgence of smallpox virus, and the threat of the use of smallpox virus 

as a weapon of biological warfare, has resulted in smallpox and other orthopox 
viruses reemerging as important public health concerns. The identification of agents 
that can either treat the symptoms caused by orthopoxviruses, or halt the spread of 
these viruses is of paramount importance. Thus, what is needed are methods for tiie 
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development of agents that may be used to target orthopoxvirases, such as smallpox. 
Such methods should allow for the rapid evaluation of large numbers of compounds 
such that the most effective compounds can be rapidly identified. In addition, such 
methods may provide a library of putative anti-viral agents. Such anti-viral agents 
5 may reduce or remove the threat of the virus as a weapon, and may act as a strong 
' deterrent to those attempting to develop pox viruses as biological weapons. 

. SUMMARY 

The present invention relates to methods of discovery that may be embodied in 
a variety of ways. In an embodiment, the methods are useful for discovering 

1 0 compounds that alter a biological activity of a compound of interest. The present also 
relates to these types of compound. 

In one embodiment, the invention may comprise a method for identifying a 
compoimd having the ability to modulate vims propagation in a host cell. The virus 
may comprise an orthopox vims, such as smallpox vims, vaccinia vims, monkeypox 

15 virus, mulluscipox virus, or cowpox virus. The method may comprise a first step of 
generating a three-dimensional model of a protein, or a portion thereof, required for 
orthopox viability. Next, a three-dimensional model of a potential modulator 

^ compound of interest may be generated. Finally, the method may comprise 
determining at least one atomic interaction between the potential modulator 

20 compoimd and the protein, or a portion thereof, as defined by the three-dimensional 
models for each. 

In one embodiment, the invention may comprise a method for identifying a 
compound that has the ability to modulate orthopox vims propagation in a host cell by 
inhibiting a viral I7L protease. The method may comprise the step generating a three- 

25 dimensioned model of I7L protein, or a portion thereof. The method may further 

comprise generating a three-dimensional model of a potential modulator compound of 
interest. Next, the method may comprise determining the nature of at least one of the 
atomic interactions between the potential modulator compoimd and the I7L protein, or 
a portion thereof, as defined by the three-dimensional models for the potential 

30 modulator compound and I7L, protein or a portion thereof. 

The present invention also provides a method of generating a three- 
dimensional model of a protein, or a portion thereof The method may comprise the 
steps of providing an amino acid sequence of the protein of interest, and comparing 
the amino acid sequence of the protein of interest to the amino acid sequence of other 
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proteins for which a three-dimensional structure has been defined to identify a second 
protein having a predetermined level of sequence identity to the protein of interest- 
Once a second protein having a known three-dimensional structure has been 
identified, the method may further include the step of aligning conserved residues 
5 from the protein of interest with conserved residues from the second protein. Next, 
the sequence for the protein of interest may be threaded along the three-dimensional 
structure of the second protein, such that the position of at least two conserved 
residues from both proteins are aligned. 

The presfflttt invention also comprises a computer model for I7L protein or a 

10 portion thereof, comprising structural coordinates for a three-dimensional model for 
I7L protein, or a portion thereof, operable to be visualizable on a computer screen. 

The present invention also provides anti-viral agents, la one embodiment, the 
anti-viral agents may iiihibit poxvirus. The anti-viral agent may comprise a 
pharmacophore. For example, in one embocHment, the present invention may 

15 comprise a pharmacophore comprising at least one atom or molecular group that 
interacts with at least one atom or molecular group of I7L protein, or a portion 
thereof. Or the anti-viral agent may comprise a compoimd. For example, in one 
embodiment, the present invention may comprise a compound comprising at least one 
atom or molecular group that interacts with at least one atom or molecular group of 

20 I7L protein, or a portion thereof. In one embodiment, the compound interacts with 
I7L to modulate the activity of I7L. For example, the compound may be a compound 
identified by docking a computer representation of the compound, or a synflietic 
variant thereof, with a computer representation of a three-dimensional structure of I7L 
protein, or a portion thereof. In one embodiment, the three-dimensional stmcture of 

25 I7L, or a portion thereof, is defined, at least in part, by Table 2. In yet another 

embodiment, the present invention may comprise a pharmaceutical composition. For 
example, the present invention may comprise a pharmaceutical composition < 
comprising a compound identified by docking a computer representation of the 
compound wifli a computer representation of a structure of I7L proteini, or a portion 

30 thereof. 

The present invention also comprises a method of conducting a drug-discovery 
business. The method may comprise the step of generating a &ree-dimensional 
structural model of a target molecule of interest on a computer. Also, the method may 
comprise generating a three-dimensional structural model of a potential modulator 
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compound of the target molecule on a computer, and docking the model for the 
potential modulator compoimd with the target molecule so as to minimize the free 
energy of the interaction between the target molecule and the potential modulator. In 
this way, a modulator compound that may interact with the target may be identijBed. 
5 The method may also include ttie subsequent steps of providing a modified structure 
for the modulator compound of interest, and assessing whether the modified structure 
bas a lower free energy of interaction with the target than the original modulator 
compound. 

In another embodiment, the present invention comprises treatment of orthopox 

10 viral infections using compounds identified by the methods and systems of the present 
invention. The orthopox viruses may include smallpox virus or other orthopox virses 
such as, but not limited to, vaccinia vims, monkeypox, or cowpox. 

There may be advantages provided by certain embodiments of flie present 
invention. For example, the methods of the present invention may provide a means to 

15 identify a plurality of putative pharmacological agents based upon the known three- 
dimensional structure of a target protein. Also, the present invention may provide a 
means to modify the structure of a putative pharmacological agent in silico to 
detemiine how such changes can effect the activity of the agent Making such 
determinations in silico provides the ability to t^ipidly evaluate a large number of 

20 compounds. Also, making such determinations in silico allows for a rational 
approach to drug development, such that compounds may be systematically 
developed and their activity evaluated. 

The present invention may provide compounds that may be used as 
pharmaceuticals for treating humans and animals suffering from, or potentially 

25 exposed to, infections caused by orthopox viruses, including smallpox, monkeypox 
and cowpox viruses. The compounds of the present invention may be used in 
combination therapy with other anti-viral agents. For example, anti-viral agents of the 
present invention that are protease inhibitors may be combined with other agents that 
act by other mechanisms. Also, the compouiids of the invention may provide broad 

30 spectrum antiviral agents with a low level of toxicity and a high therapeutic index. 

Such compounds may further provide an antiviral agent that may be used against viral 
strains that are resistant to other types of antiviral agents such as agents that inhibit 
DNA replication or immunomodulators. 
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There are, of course, additional featureis of tlie invention, which will be 
described in more detail hereinafter. It is to be understood that the invention is not 
limited in its application to the specific details as set forth in the following description 
and figures. The invention is capable of other embodiments and of being practiced or 
5 carried out in various ways, 

BRIEF DESCRIPTION OF THE FIGURES 
FIG. 1 shows a superposition of vaccinia virus (W) I7L protease Avith the C- 
terminal domain of ULPl protease in accordance with an embodiment of the present 
invention. I7L is shown as a solid ribbon and ULPl is shown as a multi-lined ribbon. 
10 Hang point residues of ULPl (His514, Cys580, Trp448) and I7L (His241, Cys328, 
Trpl68) are shown. Darker shading indicates regions of the polypeptide or individual 
residues that are closer to the viewer, whereais. lighter shading indicates regions of the 
polypeptide or individual residues that are farther away. 

FIG. 2 shows a three-dimensional homology threading model of vaccinia virus 
15 (W) I7L generated using the structure of the C-terminal portion of ULPl protease in 
accordance with an example embodiment of the present invention. Darker shading 
indicates regions of the polypeptide or individual residues that are closer to the 
viewer, whereas lighter shading indicates regions of the polypeptide or individual 
residues that are farther away. 
20 FIG. 3 shows a close-up view of the I7L ligand binding site in accordance 

with an example embodiment of the present invention. Darker shading indicates 
regions of the polypeptide or individual residues that are closer to the viewer, whereas 
lighter shading indicates regions of the polypeptide or individual residues that are 
farther away. 

25 FIG. 4 shows a computed dockmg mode of a small organic molecule, TTP-A, 

on the surfEice of 17L protease in accordance with an embodiment of the present 
invention. TTP-A is shown in a meshed three-dimensional surface. Darker shading 
indicates regions of the polypeptide or individual residues that are closer to the 
viewer, whereas lighter shading indicates regions of the polypeptide or individual 

30 residues that are farther away. 

FIG. 5 shows a view of the I7L ligand binding domain in accordance with an 
example embodiment of the present invention wherein Leu324 is represented in a 
space-filling representation. Darker shading indicates regions of the polypeptide or 
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individual residues that are closer to the viewer, whereas lighter shading indicates 
regions of the poljTpeptide or individual residues that are farther away. 

FIG. 6 shows the structure of two small molecule organic compounds, TTP-A 
and TTP-B, that bind to I7L protein, or a portion thereof in silico and that have an 
5 anti-viral effect in a cell culture assay in accordance with an example embodiment of 
the present invention. 

FIG. 7 shows a method for identification of potential therapeutic compounds 
targeted to I7L using in silico screening and optionally, biological screening, in 
accordance with an embodiment of the present invention. 
10 DETAILED DESCRIPTION 

Definition 

The following definitions may be used to understand the description herein. 
Unless defined otherwise, all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art. Practitioners are 
15 particularly directed to Current Protocols in Molecular Biology (Ansubel) for 
definitions and terms of the art. Abbreviations for amino acid residues are the 
standard 3-letter and/or 1-letter codes used in the art to refer to one of the 20 comanon 
L-amino acids. 

I7L is a protease involved in the maturation of the orfhopox vims virion and 
20 thus, is required for orfhopox virus viability and/or replication. Vaccinia virus I7L is 
a 423 amino acid cysteine protease that that catalyzes the cleavage of the 4a, 4b, and 
25K stmctural proteins foimd in the mature vaccinia virus (W) virion. The catalytic 
residues of wild-type I7L comprise a histidine' and a cysteine embedded in a 
conserved region of the protein that contains an aspartic acid. I7L may be derived 
25 firom a variety of sources, including orthopox virases such as vaccinia virus, cowpox, 
camelpox, variola major, variola minor, monkeypox, ectromelia, sheeppox, lumpy 
skin, Yaba-like, swinepox, rabbit fibroma, myxoma, fowlpox, canarypox, armsacta 
moorei viruses. The enzyme may be fi:om any source, whether natural, synthetic, 
semi-synthetic, or recombinant. A number of I7L proteins have been identified and 
30 cloned and these may be used in the methods of the invention. All of the I7L proteins 
characterized to date may be used in the methods of the present invention. 

An I7L protein or part thereof in the present invention may be a Avild type 
enzyme or part thereof, a mutant en2yme or part thereof, or variant or homologue of 
such an enzyme. As used herein, the term "wild type" refers to a polypeptide having 
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a primary amino acid sequence which is identical with the native enzsmie. The term 
"mutant" refers to a polypeptide having a primary amino acid sequence which differs 
from the wild type sequence by one or more amino acid additions, substitutions or 
deletions. A mutant may or may not be functional. As used herein, the term "variant" 
5 refers to a naturally occurring polypeptide which differs from a wild-type sequence. 
As used herein, when referring to a protein, the terms "portion" or "part" indicate that 
the polypeptide comprises a fraction (or fractions) of the amino acid sequence referred 
to- 

'Tolypeptide" and "protein" are used interchangeably herein to describe 

1 0 protein molecules that may comprise either partial or full-length proteins. 

As used herein, "smaU organic molecules" are molecules of molecular weight 
less than 2,000 Daltons that contain at least one carbon atom. 

The term **vectof' refers to a nucleic acid molecule that may be used to 
transport a second nucleic acid molecule into a cell. In one embodiment, the vector 

1 5 allows for replication of DNA sequences inserted into the vector. The vector may 
comprise a promoter to enhance expression of the nucleic acid molecule in at least 
some host cells. Vectors may replicate autonomously (extra chromosomal) or may 
be integrated into a host cell chromosome. In one embodiment, the vector may 
comprise an expression vector capable of producing a protein derived from at least 

20 part of a nucleic acid sequence inserted into the vector. 

As used herein, the term "interact" refers to a condition of proximity between 
a ligand or compound, or portions or fragments thereof, and a portion of a second 
molecule of interest. The interaction may be non-covalent, for example, as a result of 
hydrogen-bonding, van der Waals interactions, or electrostatic or hydrophobic 

25 intemctions, or it may be covalent. 

As used herein, the term "atomic contacts" or "atomic interaction" refers to the 
inter-atomic contact between atoms in a test compound and atoms in a second 
molecule (e.g., the protein of interest) for which a three-dimensional model is made. 
The atomic interaction is govemed by geometric and physiochemical 

30 complementarity as well as steric fit between the two molecules for which the atomic 
contacts/interaction is evaluated. Thus, an atomic interaction between a potential 
modulator compound and the I7L protein, or a portion fliereof, comprises at least one 
atomic interaction selected from the group consisting of charge, electrostatic, 
hydrogen bond, and hydrophobic. The atomic interaction may be covalent bond. For 
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example, atomic interactions between I7L ligand binding domain and small molecules 
TTP-A and TTP-B are described, at least in part, by Tables 5 and 6, respectively. 

As used herein, the term "docking" refers to a process by which a test 
compound is placed in close proximity with a second molecule (e.g., the protein of 
5 interest). Docking is also used to describe the process of finding low energy 
conformations of a test compound and a second molecule (e.g., the protein or 
polypeptide of interest, or portion thereof). Docking studies include molecular 
modeling studies aimed at finding a proper fit between a ligand and its binding «ite. 

As used herein, the term "docking mode" refers to a favorable configuration of 
10 a test compound docked (e.g., positioned) within a given site on a molecule of 
interest. 

As used herein, the term "hang point residues" refers to residues on a first 
molecule of known structure fbat are then used as anchors for the threading of a 
second molecule of imknown stmcture along the structure of the first molecule so as 

15 to determine a structure for the second molecule. For example, to determine a 

structure for I7L protein, or a portion thereof, residues Cys580, His514, and Tip448 
of a ULPl protein of known stmcture were the hang point residues that were aligned 
with Cys328, His241, and Trpl68 of the I7L to determine the stmcture of I7L. 

As used herein, the term "conserved residues" refers to amino acids that are 

20 the same among a pluraUty of proteins having Ihe same structure and/or fimction. A 
region of conserved residues may be important for protein structure or function. 
Thus, contiguous conserved residues as identified in a three-dimensional protein may 
be important for protein stmcture or function. To find conserved residues, or 
conserved regions of 3-D stmcture, a comparison of sequences for the same or similar 

25 proteins fi:om different species, or of individuals of the same species, may be made. 

As used herein, the term "homologue" means a polypeptide having a degree of 
homology with the wild-type amino acid sequence. Homology comparisons can be 
conducted by eye, or more usually, with the aid of readily available sequence 
comparison programs. These commercially available computer programs can 

30 calculate percent homology between two or feore sequences (e.g. Wilbur, W. J. and 
Lipman, D. J., 1983, Proc, Natl Acad. Set USA^ 80:726-730). For example, 
homologous sequences may be taken to include an amino acid sequences which in 
alternate embodiments are at least 75% identical, 85% identical, 90% identical, 95% 
identical, or 98% identical to each other. 
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The terms "identity" or "percent identical" refers to sequence identity between 
two amino acid sequences or between two nucleic acid sequences. Percent identity 
can be determined by aligning two sequences and refers to the number of identical 
residues (z.e., amino acid or nucleotide) at positions shared by the compared 
5 sequences. Sequence aligmnent and comparison may be conducted using the 
algorithms standard in the art (e,g. Smith and Waterman, 1981, Adv, AppL Math, 
2:482; Needleman and Wunsch, 1970, J. Mol Biol. 48:443; Pearson and Lipman, 
1988, Proc. Natl Acad. Set, USA, 85:2444) or by computerized versions of these 
algorithms (Wisconsin Genetics Software Package Release 7.0, Genetics Computer 

1 0 Group, 575 Science Drive, Madison, WI) publicly available as BLAST and FASTA. 
Also, ENTREZ, available through Hie National Institutes of Health, Bethesda MD, 
may be used for sequence comparison. In one embodiment, the percent identity of 
two sequences may be determined using GCG with a gap weight of 1, such that each 
amino acid gap is weighted as if it were a single amino acid mismatch between the 

1 5 two sequences. 

As used herein, a polypeptide or protein "domain" comprises a region along a 
polypeptide or protein that comprises an independent unit Domains may be defined 
in terms of structure, sequence and/or biological activity. In one embodiment, a 
polypeptide domain may comprise a region of a protein that folds in a manner that is 

20 substantially independent fix>m the rest of the protein. Domains may be identified 
using domain databases such as, but not limited to PFAM, PRODOM, PROSITE, 
BLOCKS, PRINTS, SBASE, ISREC PROFILES, SAMRT, and PROCLASS. 

As used herein, "ligand binding domain" (LED) refers to a domain of a protein 
responsible for binding a ligand. The term "ligand binding domain" includes 

25 homologues of a ligand binding domain or portions thereof. In this regard, deliberate 
amino acid substitutions may be made in the LBD on the basis of similarity in 
polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues, as long as the binding specificity of the ligand binding domain 
is retained. For example, for I7L protein, the ligand binding domain may comprise 

30 residues 1 10-423 of vaccinia virus I7L protein. 

As used herein, the "ligand binding site" comprises residues in a protein that 
directly interact with a ligand, or residues involved in positioning the ligand in close 
proximity to those residues that directly interact with the ligand. The interaction of 
residues in the ligand binding site may be defined by the spatial proximity of the 
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residues to a ligand in the model or structure. The term "ligand binding site" includes 
homologues of a ligand binding site or portions thereof. In this regard, deliberate 
amino acid substitutions may be made in the ligand binding site on the basis of 
similarity in polarity, charge, solubUity, hydrophobicity, hydrophilicity, and/or the 
5 amphipathic nature of the residues, as long as the binding specificity of the ligand 
binding site is retained. For I7L, the ligand binding site may be defined as comprising 
those residues in Table 1 . For example, the ligand binding site may be defined as 
comprising those residues in Table 1 and any other residues that are within a 3 
angstrom radius of any one of the residues iii Table 1 . 

10 As used herein, "catalytic domain" refers to a domain of a protein responsible 

for binding a substrate or that is involved in the catals^c mechanism. The term 
"catalytic domain" includes homologues of a catalytic binding domain or portions 
thereof. In ibis regard, deliberate amino acid substitutions may be made in the 
catalytic domain on the basis of snnilarity in polarity, charge, solubility,. 

15 hydrophobicity, hydrophilicity, and/or the amphipathic nature of flie residues, as long 
as the binding specificity of the catalytic site within the catalytic domain. 

As used herein, the "catalytic site" refers to a region of the catalytic domain 
that direcfly associates with a substrate or that is involved in the catalytic mechanism. 
For example, it may be a region of I7L that is responsible for binding a substrate. 

20 With reference to the models and stmctures of the present invention, residues in a 
catalytic site may be defined by their spatial prpximity to a substrate in the model or 
structure. The term "catalytic site" includes homologues of a catalytic site, or portions 
thereof. In this regard, deliberate amino acid substitutions may be made in the 
catalytic domain on the basis of similarity in polarity, charge, solubility, 

25 hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as long 
as the substrate specificity of the catalytic site is retained. For example, for I7L, 'the 
catal3^c site may be included as part of the ligand binding site to include at least 
some those residues listed in Table 1. 

As used herein, a "ligand" refers to a molecule or compound or entity that 

30 associates with a ligand binding domain, including substrates or analogues or parts 
thereof. As described herein, the term "ligand" may refer to compounds that bind to 
the protein of interest. A ligand may be a modulator. Or, a ligand may not have a 
biological effect Or, a ligand may block the binding of other ligands thereby 
inhibiting a biological effect. Ligands may include, but are not limited to, small 
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molecule inhibitors of the activity of protein. These small molecules may include 
peptides, peptidomimetics, organic compounds and the like. For proteases, ligands 
naay also include polypeptide and protein substrates. 

As used herein, a "modulator compound** refers to a molecule which changes 
5 or alters the biological activity of a molecule of interest. A modulator compound may 
increase or decrease activity, or change the physical or chemical characteristics, or 
functional or immunological properties, of the molecule of interest For I7L, a 
modulator compound may increase or decrease activity, or change the characteristics, 
or functional or immunological properties of the I7L, or a portion threof A 

10 modulator compound may include natural and/or chemically synthesized or artificial 
peptides, modified peptides (e.g., phosphopeptides), antibodies, carbohydrates, 
monosaccharides, oligosaccharides, polysaccharides, glycolipids, heterocyclic 
compoimds, nucleosides or nucleotides or parts thereof, and small organic or 
inorganic molecules. A modulator compound may be an endogenous physiological 

1 5 compound or it may be a natural or synthetic compound. Or, the modulator 

compound may be a small organic molecule. The term "modulator compoimd" also 
includes a chenGucally modified ligand or compound, and includes isomers and 
mcemic forms. 

The terms "stractural coordinates" or "atomic coordinates" as used herein 
20 refers to a set of values that define the position of one or more amino acid residues or 
molecules with reference to a system of axes. A data set of structural coordinates 
defines the three dimensional stmcture of a molecule or molecules. Stractural 
coordinates can be slightiy modified and still render nearly identical three 
dimensional stmctures. A measure of a unique set of structural coordinates is the 
25 root-mean-square deviation of the resulting structure. In alternate embodiments, 

stractural coordinates that render three dimensional stractures that deviate firom one 
another by a root-mean-square deviation of less than 3 angstroms, or less than 2.0 
angstroms, or less than 0.5 angstroms, or less than 0.3 angstroms, may be viewed by a 
person of ordinary sldll iu the art as identical. Variations in stractural coordinates 
30 may be genemted because of mathematical manipulations of the stractural coordinates 
of I7L as described hereia. For example, the stractural coordinates of Tables 2-4 may 
be manipulated by crystallographic permutations of the stractural coordinates, 
firactionalization of the stractural coordinates, integer additions or subtractions to sets 
of the stractural coordinates, inversion of the stractural coordinates or any 
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combination of the above. Variations in structure due to mutations, additions, 
substitutions, and/or deletions of tihie amino acids, or other changes in any of the 
components that make up a stmcture of the invention may also account for 
modifications in structural coordinates. If such modifications are within the standard 
5 error as compared to the original structural coordinates, the resulting structure may be 
considered to be the same or equivalent. Therefore, a ligand that bound to a ligand 
binding domain of an I7L would also be expbcted to bind to another ligand binding 
domain whose stmctural coordinates deiSned a shape that fell within the margui of 
error defined by the first structure. Such modified stractures of a ligand binding 
. 10 domain are also within the scope of the invention. For example, using the surface 
topology of a group of ligands^ such as low-energy binding modes ofTTP-A and 
TTP-B, which exhibit effector quality (agonist or antagonist) can be overlapped and 
the contours of all TTP-A and TTP-B averaged into a union sur&ce. This union 
surface of a ligand is expected to be complementary to the surface mold of the 

1 5 corresponding binding site of I7L enzyme. 

As used herein, a structural **model" of a protein of interest, a polypeptide of 
interest, or any other compound of interest, may be in two or three dimensions. For 
example, a computer model may be in three diniensions despite the constraints 
imposed by a computer screen, if it is possible to scroll along at least a pair of axes, 

20 causing rotation of the image. Also, a model of a protein or chemical compound of 
interest may be defined by the structural coordinates for the protein or compound of 
interest. 

As used herein, the terms "modeling" or "generating a model" includes the 
quantitative and qualitative analysis of molecular stmcture and/or function based on 
25 atomic stmctural information and interaction models. The term may include 

conventional numeric-based molecular dynamic and energy minimization models, 
interactive computer graphic models, modified molecular mechanics models, distance 
geometry, and other structure-based constraint models. 

The term "substrate" refers to the molecule or compound that is the target of 
30 an en^mie. For IVL, a substrate may include proteins and polypeptides cleaved by the 
I7L protease and includes the 4a, 4b, and 25K structural proteins of vaccinia virus. 

The term "peptide mimetics" are structures which serve as substitutes for 
peptides in interactions between molecules (Morgan et al., 1989, Ann, Reports Med, 
Chem., 24:243-252). Peptide mimetics may include synthetic stmctures fliat may or 
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may not contain amino acids and/or peptide bonds but that retain the structural and 
functional features of a peptide, or agonist, or antagonist Peptide mimetics also 
include peptoids, oligopeptoids (Simon et al*, ,1972, Proc, Natl Acad, ScL, USA, 
89:9367); and peptide libraries containing pe;p.t^des of a designed length representing 
5 all possible sequences of amino acids corresponding to a peptide, or agonist or 
antagonist of the invention. 

The term "treating'' refers to improving a symptom of a disease or disorder 
and may comprise curing the disorder, substantially preventing the onset of the 
disorder, or improving the subject's condition. The term "treatment as used herein, 
10 refers to the full spectrum of treatments for a given disorder from which the patient is 
suffering, including alleviation of one, most of all symptoms resulting from that 
disorder, to an outright cure for the particular disorder or prevention of the onset of 
the disorder. 

As used herein, "TC50" is the conc^tration at which 50% of the cells display 
15 signs of cj^otoxicity. Also, "IC50" is the concentration at which there is 50% 

inhibition of the measmred effect of interest For I7L, "IC50'* is the concentration at 
which there is 50% inhibition of viral cytopatibdc effect The therapeutic index, "TI," 
is a ratio of the TC50 to the IC50. Thus, clinical beneficial drugs are generally those 
that have a high TI. 

20 As used herein, "pharmacophore" is a collection of steric and elctronic 

features tiiat are necessary to ensure the optimal supramolecular interactions with a 
specific biological target structure. A pharmacophore may comprise a structural 
definition that comprises a set of active molecules. For example, using the surface 
topology of a group of ligands, such as low-energy binding modes of TTP-A and 

25 TTP-B, which exhibit effector quality (agonist or antagonist) can be overlapped and 
the contours of all TTP-A and TTP-B averaged into a union surface that comprises a 
pharmacophore. This pharmacophore is expected to be complementary to the surface 
mold of the corresponding binding site of I7L enzyme. 

As used herein, an "effective amounf ' as used herein means the amount of an 

30 agent that is effective for producing a desired effect in a subject The term 

"therapeutically effective amounf denotes that amount of a drug or pharmaceutical 
agent that will elicit the therapeutic response of an animal or human that is being 
sought. The actual dose which comprises the effective amount may depend upon the 
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route of acfaninistration, the size and health of the subject, the disorder being treated, 
and the like. 

The term ^^pharmaceutical composition" is used herein to denote a 
composition that may be administered to a mammalian host, e,g., orally, topically, 
5 parenterally, by inhalation spray, or rectally, in unit dosage formulations containing 
conventional non-toxic carriers, diluents, adjuvants, vehicles and the like. The term 
''parenteral" as usedhereio, includes subcutaneous injections, intravenous, 
intramuscular, intracistemal injection, or by infusion techniques. 

The term "a" or "an" as used herein may refer to more than one object unless 
10 the context clearly indicates otherwise. The term "or" is used interchangeably with 
the term "and/or" unless the context clearly indicates otherwise. 
Ligands for TIL as modulators of orthopox viruses 

Embodiments of the present invention provide ligands for I7L as modulators 
of viruses and methods for discovery of such ligands. In one embodiment, the 
15 invention may comprise a method for identifying a compound having flie ability to 
modulate orthopox virus propagation in a host cell. The method may comprise the 
steps of: (a) generating a three-dimensional model of a protein required for orthopox 
viability, or a portion thereof; (b) generating a three-dimensional model of a potential 
modulator compound of interest; and (c) determining at least one atomic interaction 
20 between the potential modulator compound and the protein, or a portion thereof, as 
defined by the three-dimensional models of each. 

The virus may comprise an orthopox vims, such as smallpox virms, vaccinia 

virus, monkeypox viras, muUuscipox virus, cowpox virus, camelpox virus, variola 

major vims, variola minor vims, ectromelia virus, sheeppox virus, lumpy skin virus, 

25 Yaba-like viras, swinepox viras, rabbit fibroma viras, myxoma vims, fowlpox virus, 

. 1 * 

canarypox viras, or amsacta moorei viras. In one example embodiment, the viras is 
smallpox viras. The protein may be any protein that is required for viability of the 
viras IQ a host cell. For example, the protein may be a protease that is required for 
formation or morphogenesis of the viras. Or, the protein may be required for DNA 
30 replication. The protein may be a cysteine protease. In one example embodiment, the 
protein is an I7L protease, such as vaccinia viras I7L protein. 

The method may be performed using a computer. Thus, in one embodiment, 
the method comprises the steps of: (a) generating a three-dimensional computer 
model of the protein, or a portion thereof; (b) generating a three-dimensional 
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computer model of the potential modulator compound of interest; (cl) using a 
computer to dock the three-dimensional model of the potential modulator compound 
within the model of the protein or a portion thereof; and (c2) quantifying at least one 
atomic interaction between the potential modulator compound and the protein, or a 
5 portion thereof. 

The method further allows for varying the stmcture of the potential modulator 
compound to determine how changes to the structure of the modulator may affect the 
fit of the compound with the protein of interest. Thus, the method may further 
comprise the steps of modifying Ihie computer model of the potential modulator 

10 compound, and evaluating how modifying the computer model of the potential 

modulator compound changes at least one atomic interaction between of the model of 
the potential modulator compound and the model of the protein, or portion thereof. 
The potential modulator compound may be modified in silico. Thus, in one 
embodimex^t, the step of modifying the computer model of the potential modulator 

15 compound of interest comprises the step of searching a library of molecular stmctures 
for molecular fragments that caii be linked to the potential modulator compound, 
wherein a molecular ftagment comprises at least one atom. The method may further 
comprise linking a molecular fragment to the potential modulator compound to 
generate a modified compound. The modified compound may then be evaluated by 

20 docking the modified compound to the protein of interest and quantifying at least one 
atomic interaction between the modified compound and the protein of interest. 

Also, the compound may be evaluated in a biological assay. Thus, the 
compound may be evaluated by its ability to inhibit virus growth or propagation. 
Also, the compoimd may be evaluated for cytotoxicity to uninfected cells. In one 

25 embodiment, the therapeutic index (TI), comprising the TC50 (concentration of the 
compound for which 50% of uninfected cells display signs of toxicity) divided by the 
IC50 (concentration at which the viral cytopathic effect is inhibited 50%) for the 
compound may be determined. 

It may not be required to determine the entire stmcture of the protein of 

30 interest to identify compounds that may act as modulators of the protein. For 

example, the three-dimensional model of the protein of interest may comprise only a 
portion of the protein. Thus, liie model may comprise the catalytic domain. 
Additionally or altematively, the model may comprise a ligand binding domain. 
Additionally or altematively, the model may comprise a ligand binding site. 
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Additionally or alternatively, the model may comprise the catalytic site. In some 
cases, the ligand binding site may also comprise the catalj^c site. 

It is also not necessarily required to determine how each amino acid of the 
entire structure of the protein of interest interacts with a potential modulator 
5 compound to identify compounds that may act as modulators of the protein. For 
example, the amino acid used to determine an atomic interaction between a potential 
modulator compound and the protein of interest may comprise a residue that is 
conserved in the protein of interest. Additionally, or alternatively, the amino acid 
used to determine an atomic interaction between a potential modulator compound and 

10 the protein of interest may comprise a residue that is present in, or affects the structure 
of, the catalytic domain and/or the catalj^c site. Additionally, and/or alternatively, an 
amino acid used to determine an atomic interaction between a potential modulator 
compound and flie protein of interest may comprise a residue feat is present in, or 
affects the structure of, the ligand binding domain and/or the ligand binding site. 

15 It has been shown that I7Lprotein (i.e., virion core protein proteinase) may be 

required for morphogenesis of orfeopox viruses, and that without a functional I7L , 
protein, propagation of the vims may be reduced. Thus, in one embodiment, fee 
invention may comprise a mefeod for identifying a compoimd having the ability to 
modulate orfeopox vims propagation in a host cell, where fee compound acts by 

20 inhibiting an I7L protease. The orfeopox vims may comprise smallpox virrus, 

vaccinia virus, monkeypox virus, mulluscipox yiras, cowpox vims, camelpox vims, 
variola major virus, variola noinor vims, ectromelia vims, sheeppox vims, lumpy skin 
viras, Yaba-like vims, swinepox vims, rabbit fibroma virus, myxoma vims, fowlpox 
virus, canarypox vims, or amsacta moorei vims. In one example embodiment, fee 

25 viras is smallpox virus. The mefeod may comprise fee steps of: (a) generating a 

three-dimensional model of a I7L protein, or a portion feereof; (b) generating a three- 
dimensional model of a potential modulator compound of interest; and (c) 
determining at least one atomic interaction between fee potential modulator 
compound and fee I7L protein, or a portion feereof, as defined by fee three- 

30 dimensional models of fee I7L protein, or a portion feereof, and fee potential 
modulator compound of interest. 

The model of I7L may comprise a variety of formats. In one embodiment, the 
model may comprise a three-dimensional stmctural model. Or, fee model of I7L may 
comprise stractural coordinates presented as fee position of individual atoms of fee 
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I7L protein, or a portion thereof, in space. For example, the noiodel of I7L, or a 
portion thereof, may comprise the x, y, and z atomic coordinates as dejBned in Table 
2- 

The model of I7L protein, or a portion thereof, may be derived at least in part 
5 from the structure of a protein that comprises at similar function to I7L. The method 
of generating the computer model raay comprise aligning the structure of the I7L 
protein, or a portion thereof, with a second cysteine protease. In one example 
embodiment, the second cysteine protease is ubiquitin-like protein 1 (ULPl) protease. 
The model of I7L may be derived at least in part by aligning conserved 

10 sequences from the I7L protein, or a portion thereof, and a second protein. In one 
embodiment, the amino acids used to align the structure of the W I7L protein or a 
portion thereof with ULPl comprise His241, Asp248, and Cys328 of the I7L protein 
and His 514, Cys 580 and Trp448 of ULPl. . 

The method may be perfomied using a computer. Thus, in one embodiment, 

15 the method comprises the steps of: (a) generating a three-dimensional computer 
model of the I7L protein, or a portion thereof; (h) generating a three-dimensional 
computer model of the potential modulator compound; (cl) using a computer to dock 
the three-dimensional model of the potential modulator compound with the model of 
the I7L protein, or a portion thereof; and (c2) quantifying at least one atomic 

20 interaction between the potential modulator compound and the I7L as defined by the 
docking of the model of the potential modulator compound in the computer model of 
the I7L protein, or a portion thereof. 

The method further allows for varying the structure of the potential modulator 
compound to determine how changes in the structure can affect the fit of the potential 

25 modulator compound with the protein of interest. Thus, the method may further 
comprise the steps of modifying the computer model of the potential modulator 
compound, and evaluating how modifying the computer model of the potential 
modulator compound affects the atomic interkctions between of the model of the 
potential modulator compound and the model of the I7L protein, or portion thereof. 

30 The potential modulator compound may be modified in silico. Thus, in one 

embodiment, the step of modifying the computer model of the potential modulator 
compound of interest comprises the step of searching a library of molecular structures 
for molecular fragments that can be linked to the potential modulator compound, 
wherein a molecular fragment comprises at least one atom. The method may further 
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comprise linking a molecular firagment to the potential modulator compound to 
generate a modified compoimd. The modified compound may then be evaluated by 
docking the modified compound to the I7L protein, or a portion thereof, and 
determining the atomic interactions between the modified compound and the I7L 
5 protein. 

It is not necessarily required to determine the entire structure of the protein of 
interest to identify compounds that may act as modulators of the protein. For 
example, the three-dimensional model of the protein of interest may comprise only a 
portion of the protein. Thus, the model may comprise the catals^ic domain, or a 

10 portion thereof. For example, the model may comprise the catalytic site. 

Additionally or alternatively, the model may comprise a ligand binding domain, or a 
portion thereof, such as the ligand binding site. For I7L, the ligand binding site may 
also comprise the catalytic site. 

It may not be required to determine how each amino acid of the entire 

1 5 structure of the I7L protein interacts with a potential modulator compound to identify 
compounds that may act as modulators of the I7L protein. For example, an amino 
acid used to determine the atonuc interactions between a potential modulator 
compoimd and the I7L protein may comprise a residue that is conserved in the I7L 
protein. Additionally or altematively, the amino acid used to determine an atomic 

20 interaction between a potential modulator compound and the I7L protein may 

comprise a residue that is present in, or affects the structure of, the catalj^c domain 
and/or catalytic site. Additionally, or alternatively, an amino acid used to determine 
an atomic interaction between a potential modulator compound and the I7L protein 
may comprise a residue that is present in, or affects the structure of, the ligand 

25 binding domain and/or ligand binding site. 

The residues that are used to determine the atomic interactions between a 
potential modulator compoimd and the I7L protein may comprise an amino acid that 
is active in catalysis. In one example embodiment, the amino acids used to determine 
an atomic interaction between a potential modulator compound and the I7L protease, 

30 or a portion thereof, comprises the catalytic cysteine of the I7L protein. In one 
embodiment, the atomic interactions with the catalytic cysteine may comprise a 
charge or electrostatic interaction. Additionally, or alternatively, the amino acids used 
to determine an atomic interaction between a potential modulator compoxmd and the 
I7L protein, or a portion thereof, may comprise at least one of Cys328, His241, 
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Asp248, or Asp258 of the I7L protein. Additionally, or alternatively, the amino acids 
used to determine an atomic int^action between a potential modulator compoxmd and 
the I7L protein, or a portion thereof, may comprise at least one of Leu324, Trp242, or 
Gln322 of the I7L protein. Or, the amino acids used to determine an atomic 
5 interaction between a potential modulator compound and the I7L protein, or a portion 
thereof, may comprise at least one of Gly329, Leu323, Ser240, TrpieS, Asp 194, 
Asnl71, Serl73, Ghi322, Metl95, Ser326, Glu327, Leu239, Leul77, Asnl99, 
Metl69, Plie236, Ile203, or Met233 of the I7L protein. In one embodiment, the I7L 
protein, or portion thereof, is W I7L. 

10 Depending on the source of the jnrotein used to generate a three-dimensional 

stmcture, there may be some variability in the absolute positioning of each amino 
acid. Still, it is to be expected that the relative positions of conserved amino acids 
may be maintained. For example, it has been found that there is a high degree in the 
catalytic triad sequence region (i.e., His241, Asp248, and Cys328 for W I7L) of I7L 

15 proteins isolated from various poxviruses (Byrd, CM. et al., 2004, J. Virol., 

78:12147-12156) Thus, alignment of sequences immediately surrounding arnino 
acids in the catalj^c triad may comprise 95-99 percent sequence identity and identical 
spacmg between the residues. Thus, for I7L, the amino acids used to determine the 
atomic interactions between a potential modulator compound and the I7L protein may 

20 comprise Cys(N)> wherein position N corresponds to the catalytic cysteine. In one 
embodiment, the catalytic cyeteine corresponds to Cys328 of vaccinia vims I7L. 
Additionally, the amino acids used to determine the atomic interactions between a 
potential modulator compound and the I7L protein, or a portion thereof, may 
comprise at least one of His(N-87), Asp(N-80), or Asp(N-70) of the I7L protein, 

25 wherein position N corresponds to the catalytic cysteine of the I7L. Additionally, the 
amino acids used to determine the atomic interactions between a potential modulator 
compound and the I7L protein, or a portion thereof, may comprise at least one of 
Leu(N-4), Trp(N-86), or Gln(N-6) of the I7L protein, wherein position N corresponds 
to the catalytic cysteine of the I7L. Additionally, or alternatively, the amino acids 

30 used to determine the atomic interactions between a potential modulator compound 
and the I7L protein, or a portion thereof, may comprise at least one of Gly(N+l), 
Leu(N-5), Ser(N-88), Trp(N-160), Asp(N-134), Asn(N-157), Ser(N-155), Met(N- 
133), Ser(N-2), Glu(N-l), Leu(N-89), Leu(N-151), Asn(N-129), MetCN-159), Phe(N- 
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92), ne(N-125) or Met(N-95), wherein position N corresponds to the catalytic 
cysteine of I7L. 

The analysis may further employ a modified protein. Thus, the potential 
modulator compound may be evaluated for its interaction with a modified I7L protein, 
5 or portion thereof, wherein the I7L comprises at least one of an amino acid 

substitution, an amino acid deletion, or an annino acid insertion. In one embodiment, 
the amino acids used to determine the nature of the association between a test 
compound and the I7L protein, or a portion thereof, comprise at least one of wild-type 
or altered amino acid in the I7L protein corresponding to positions 168, 169, 171, 173, 
10 177, 194, 195, 199, 203, 233, 236, 239, 240, 241, 242, 248, 258, 322, 323, 324, 326, 
327, 328, or 329 of the wild-type W I7L protein. 

The nature of the interaction between the potential modulator compound and 
the protein of interest may be defined in tenns of the atomic interaction between the 
compound and the protein of interest. In an embodiment, the atomic interaction 
1 5 between a potential modulator compound and the I7L protein, or a portion thereof, 

comprises at least one atomic interaction selected from the group consisting of charge, 
electrostatic, hydrogen bond, and hydrophobic. Alternatively, the atomic interaction 
between a potential modulator compound and the I7L protein, or portion thereof, may 
comprise at least two hydrogen bond atomic interactions, at least two hydrophobic 
20 atomic interactions, and at least one of a charge or electrostatic interaction. Or, the 
atomic interaction between a potential modulator compoimd and the I7L protein, or a 
portion thereof, may comprise at least three hydrogen bond atomic interactions, at 
least three hydrophobic atomic interactions, and at least one of a charge or 
electrostatic interaction. For example, for I7L, the atomic interactions between the 
25 modulator compound and I7L may comprise at least one of the atomic interactions 
described in Table 5. Or, the atomic interactions between the modulator compound 
and I7L may comprise at least one of the atomic interactions described in Table 6. 

Also, the compound may be evaluated in a biological assay. Thus, the 
compound may be evaluated for inhibition of the virus. Also, the compound may be 
30 evaluated for cytotoxicity on uninfected cells. In one embodiment, the therapeutic 
index (TI), comprising the TC50 for the compound divided by the IC50 for the 
compound, may be determined. 

The present invention also comprises a method of generating a three- 
dimensional model of a protein of interest, or a portion thereof. In one embodiment. 
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method may comprise the steps of: (a) providing an amino acid sequence of a protein 
of interest; (b) comparing the amino acid sequence of the protein of interest to the 
amino acid sequences of a plurality of other proteins; (c) identifying a second protein 
for which a three-dimensional structure has been defined, and that has a 
5 predetermined level of sequence identity to the protein of interest; (d) aligning 
conserved residues firom the protein of interest with conserved residues firom the 
second protein; and (e) threading the protein of interest along the three-dimensional 
structure of the second protein such that the position of at least two conserved 
residues from both proteins are aUgned. 

10 The protein aligned with the protein of interest may also comprise a protein 

having a similar sequence to the protein of interest The level of sequence identity 
may range from at least 5% sequence identity, to more than 10% sequence identity, to 
more than 20% sequence identity. Also, the protein aligned with the protein of 
interest may comprise a protein having a similar frmction as the protein of interest. 

15 In one example embodiment, the protein of interest may comprise I7L and the 

second protein comprises ubiquitin-like protein 1 (ULPl). Where the protein of 
interest is I7L, and the second protein is ULPly the anmio acids used to align the 
structure of the I7L protein with ULPl may comprise His241, Asp248, and Cys328 of 
the I7L protease, and His 514, Cys 580 and Trp448 of ULPl. 

20 The present invention may also comprise a structural model for a protein, or a 

portion of a protein, that may be manipulated using a computer. In one example 
embodiment, the present invention may comprise a computer model for I7L protein, 
or a portion thereof. The model may comprise atomic coordinates for a three- 
dimensional model for I7L, or a portion thereof, operable to be visualizable on a 

25 computer screen. 

In one embodiment, the computer model of the protein of interest may 
comprise atomic coordinates presented as the position of individual atoms of the I7L 
protein, or a portion thereof, in space. For example, the model of I7L, or a portion 
thereof, may comprise at least some of the x, y, and z coordinates as defined in Table 

30 2. 

Also, the model may comprise a three-dimensional computer model of a 
potential modulator compound docked into the I7L structure such that the atomic 
interaction between the I7L and the potential modulator compound may be quantified. 
The atomic interactions between the I7L and the potential modulator compound may 
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be defined at least in part determining atomic coordinates for the potential modulator 
compound as it interacts with the I7L protein. In one embodiment, the three 
dimensional structure of a potential modulator compound may comprise at least some 
of the atomic coordinates as defined in Table 3 or Table 4. 
S The residues that are used to determine the atomic interactions between a 

potential modulator compound and the I7L protease may comprise an amino acid that 
is active in catalysis. In one example embodiment, the amino acid used to determine 
an atomic interaction between a potential modulator compound and the I7L protease, 
or a portion thereof, comprises the catalytic cysteine of the I7L protein. In one 

10 embodiment, the atomic interactions wilh the catalytic cysteine may comprise a 
* charge or electrostatic interaction. Or, an amino acid used to determine an atomic 
interaction between a potential modulator compound and the I7L protease, or a 
portion thereof, may comprise at least one of Cys328, His241, As^248, Asp258 of the 
I7L protein. Or, an amino acid used to determine an atomic interaction between a 

15 potential modulator compound and the I7L protease, or a portion thereof, may 

comprise at least one of Leu324, Trp242, or Gln322 of the I7L protein. Additionally, 
or alternatively, the amino acids used to determine an atonuc interaction between a 
potential modulator compound and the I7L protease, or a portion thereof, may 
comprise at least one of Gly329, Leu323, Ser240, Trpl68, Aspl94, Asnl71, Serl73, 

20 Gin 322, Metl95, Ser326, Glu327, Leu239, Leul77, Asnl99, Metl69, Phe236, 

Ile203, or Met233 of the I7L protein. In one embodiment, the I7L protein, or portion 
thereof, is W I7L. 

Depending on the source of the protein used to generate a three-dimensional 
structure, there may be some variability in the absolute positioning of each amino 

25 acid. Still, it is to be expected that the relative positions of conserved amino acids 

may be maintained as there is a high degree in the catalytic triad sequence region (i.e., 
His241, Asp248, and Cys328 for W I7L) of I7L proteins isolated from various 
poxvnuses (Byrd, CM. et al., 2004, J. Virol., 78:12147-12156) Thus, alignment of 
sequences immediately surrounding amino acids in the catalytic triad may comprise 

30 95-99 percent sequence identity and identical spacing between the residues. Thus, for 
I7L, the amino acids used to determine the atomic interactions between a potential 
modulator compound and I7L protease may comprise Cys(N), wherein position N 
corresponds to the catalytic cysteine of I7L. Additionally, the amino acids used to 
determine the atomic interactions between a potential modulator compound and the 
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I7L protein, or a portion thereof, may comprise at least one of His(N-87), AspON[-80), 
Asp(N-70), of the I7L protein, wherein position N corresponds to the catalytic 
cysteine of I7L. Or, the amino acids used to determine the atomic interactions 
between a potential modulator compound and the I7L protein, or a portion thereof, 
5 may comprise at least one of Leu(N-4), Trp(N-86), or Gln(N-6) of the I7L protein, 
wherein position N corresponds to the catalytic cysteine of I7L. Additionally, the 
amino acids used to determine the atomic interactions between a potential modulator 
compound and the I7L protease, or a portion thereof, may comprise at least one of 
Gly(N+l), Leu(N-5), Ser(N-88), TrpCN^160), Asp(N-134), Asn(N-157), Ser(N-155), 
10 Met(N-133), Ser(N-2), Glu(N-l), Leu(N-89), Leu(N-151), Asn(N-129), Met(N-l 59), 
Phe(N-92), Ile(N-125) or Met(N-95), wherein position N corresponds to the catalytic 
cysteine of I7L. 

The computer model may further employ a modified protein. Thus, the 
potential modulator compound may be evaluated for its intemction with a modified 

1 5 I7L protein, or portion thereof, wherein the I7L comprises at least one of an amino 
acid substitution, an amino acid deletion, or an amino acid insertion. In one 
embodiment, the amino acids used to determine the nature of the association between 
a potential modulator compound and the I7L protein, or a portion thereof, comprise at 
least one of wild-type or altered amino acid iri the I7L protein corresponding to 

20 positions 168, 169, 171, 173, 177, 194, 195, 199, 203, 233, 236, 239, 240, 241, 242, 
248, 258, 322, 323, 324, 326, 327, 328, or 329 of the wild-type W I7L protein. 

The model may allow for the nature of the interaction between the potential 
modulator compound and the protein of interest to be defined hi terms of the atomic 
interaction between the compound and the protein of interest In an embodiment, the 

25 atomic interaction between a potential modulator compound and the I7L protein, or a 
portion thereof, comprises at least one atomic interaction selected firom the group 
consisting of charge, electrostatic, hydrogen bond, and hydrophobic. Alternatively, 
the atomic intemction between a potential modulator compound and the I7L protein, 
or portion thereof, may comprise at least two hydrogen bond atomic interactions, at 

30 least two hydrophobic atomic interactions, and at least one of a charge or electrostatic 
interaction. Or, the atomic interaction between a potential modulator compound and 
the I7L protein, or a portion thereof, may comprise at least three hydrogen bond 
atomic interactions, at least three hydrophobic atomic interactions, and at least one of 
a charge or electrostatic interaction. For example, for I7L, the atomic interactions 
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between the modulator compound and I7L may comprise at least one of the atomic 
interactions described in Table 5. Or, the atomic interactions between the modulator 
compound and I7L may comprise at least one of the atomic interactions described in 
Table 6. 

5 The model allows for varying the structure of the potential modulator 

compound to determine how changes in the structure of the modij&ed compound can 
effect the fit of the compound with the protein of interest. Thus, the model may 
further comprise a three-dimensional model of a modified compound docked with the 
I7L stmcture. The potential modulator compound may be modified in silica. Thus, in 

10 one embodiment, the step of modifying the computer model of the potential 

modulator compound of interest comprises the step of searching a library of molecular 
•structures for molecular fragments that can be linked to the potential modulator 
compound, wherein a molecular firagment comprise at least one atom, and linking the 
fi:agments to the compound. The modified compound may then be evaluated by 

15 docking the modified compound to the I7L protein, or a portion thereof, and 

determining the atomic interactions between the modified compoimd and the I7L 
protein. 

The present invention also comprises a pharmacophore having a structure 
required to modify the protein of interest. For example, the pharmacophore may 

20 comprise at least one atom or molecular group that interacts with at least one atom or 
molecular group of I7L protein, or a portion thereof. Additionally, the three 
dimensional structure pf the pharmacophore may comprise a plurality of atoms or 
molecular groups that interact with at least one atom or molecular group of a three- 
dimensional structure of I7L protein, or a portion thereof. To be active as a modulator 

25 of I7L, the pharmacophore may interact with the ligand binding domain of I7L, or a 
portion thereof, such as the ligand binding site. Additionally or altematively, the 
pharmacophore may interact with the catalytic domain, or a portion therof such as the 
catalytic site of I7L. 

The stmcture of the pharmacophore may vary with changes in the structure of 

30 flie protein of interest In one embodiment for I7L, the three-dimensional structure of 
I7L may be defined by at least some of the atomic coordinates as defined in Table 2. 
Where I7L is defined by the coordinates of Table 2, the spatial arrangement of atoms 
within the pharmacophore may comprise the atomic coordinates for at least one of the 
docking modes as defined in Table 3. In another example embodiment, the spatial 
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arrangement of atoms within the pharmacophore may comprise the atomic 
coordinates for at least one of the docking modes as defined in Table 4. 

The nature of the interaction between the pharmacoph,ore and the protein of 
interest may be defined in terms of the atomic interaction between the pharmacophore 
5 and the protein of interest In an embodiment, the atondc interaction between a 

potential modulator compoimd and the I7L protein, or a portion thereof, comprises at 
least one atomic interaction selected firom the group consisting of charge, 
electrostatic, hydrogen bond, and hydrophobic. Alternatively, the atomic interaction 
between a potential modulator compound and the I7L protein, or portion thereof, may 
10 comprise at least two hydrogen bond atomic interactions, at least two hydrophobic 

r 

atomic interactions, and at least one of a charge or electrostatic intemction. Or, the 
atomic interaction betvvreen a potential modulator compound and the I7L protein, or a 
portion thereof, may comprise at least three hydrogen bond atomic interabtions, at 
least three hydrophobic atomic interactions, and at least one of a charge or 

15 electrostatic interaction. For example, for I7L, the atomic interactions between the 
pharmacophore and I7L may comprise at least one of the atomic interactions 
described in Table 5. Or, the atomic interactions between the pharmacophore and I7L 
may comprise at least one of the atomic interactions described in Table 6. 

The pharmacophore may be defined by its ability to interact with amino acids 

20 in the protein of interest that are important for catalytic activity and/or substrate 
binding. In one embodiment for an I7L pharmacophore, the interacting atom or 
molecular group for I7L may comprise the catalytic cysteine of I7L. In one 
embodiment, the atomic interactions with the catalytic cysteine may comprise a 
charge or electrostatic interaction. Or, the interacting atom or molecular group for 

25 I7L may comprise at least one of amino acids Cys328, His241, Asp248, Asp258, of 
I7L. Or, the interacting atom or molecular group for I7L may comprise at least one of 
amino acids Leu324, Trp242, and Gln322 of I7L. Additionally, or alternatively, the 
interacting atom or molecular group of I7L rniay comprise at least one of Gly329, 
Leu323, Ser240, Trpl68, Aspl94, Asnl71, Serl73, Ghi 322, Metl95, Ser326, 

30 Glu327, Leu239, Leul77, Asnl99, Metl69, Phe236, Ile203, or Mel233 of theI7L 
protein. In an embodiment, the I7L, or a portion thereof, comprises W I7L. 

Depending on the source of the protein used to generate a three-dimensional 
structure, there may be some variability in the absolute positioning of each amino 
acid. Still, it is to be expected that the relative positions of conserved amino acids 
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may be maintained. As described above, there is a high degree in the catalytic triad 
sequence region (i.e,, His241, Asp248, and Cys328 for W I7L) of I7L proteins 
isolated from various poxviruses (Byrd, CM. et al., 2004, J. Virol., 78:12147-12156) 
Thus, alignment of sequences immediately surrounding amino acids in the catalytic 
5 triad may comprise 95-99 percent sequence identity and identical spacing between the 
residues. For I7L, the interacting group(s) used to determine the atomic interactions 
between the pharmacophore and I7L protein may comprise Cys(N), wherein position 
N corresponds to the catalytic cysteine of I7L. Additionally, the interacting group(s) 
may comprise at least one of His(N-87), Asp(N-80), Asp(N-70), of the I7L protein, 

10 wherein position N corresponds to the catalytic cysteine of I7L. Additionally, the 
interacting group(s) may comprise at least one of Leu(N-4), Trp(N-86), or Gln(N-6) 
of the I7L protein, wherein position N corresponds to the catalytic cysteine of I7L. 
Additionally, the interacting group of I7L may comprise at least one of Gly(N-M), 
Leu(N-5), Ser(N-88), TipCN-160), Asp(N-134), Asn(N-157), Ser(N-155), Met(N" 

15 133), Ser<N-2), Glu(N-l), Leu(N-89), Leu(N-151), Asn(N-129), Met(N-159), Phe(N- 
92), Ile(N-125) orMet(N-95), wherein position N corresponds to the catalytic 
cysteine of I7L. 

The computer model may further employ a modified protein. Thus, the 
pharmacophore may be evaluated for its interaction with a modified I7L protein, or 

20 portion thereof, wherein the I7L comprises at least one of an amino acid substitution, 
an amino acid deletion, or an amino acid insertion. In one embodiment, the I7L 
amino acids used to detemoine the nature of the association between the 
pharmacophore and the I7L protein, or a portion thereof, comprise at least one of 
wild-type or altered amino acid in the I7L protein corresponding to positions 168, 

25 169, 171, 173, 177, 194, 195, 199, 203, 233, 236, 239, 240, 241, 242, 248, 258, 322, 
323, 324, 326, 327, 328, or 329 of the wild-type W I7L protem. 

In yet another embodimrat, the present invention comprises compounds that 
interact with at least one atom or molecular group of the I7L protein. In an 
embodiment, such compounds bind to the catalytic domain and/or catalytic site pf 

30 I7L. In yet another embodiment, the compounds include molecules that interact with 
residues known to be in the ligand binding domain and/or ligand binding site. In yet a 
fiurttier embodiment the compound comprises TTP-A or TTP-B. 
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The interaction between the compound and I7L may comprise an in silico 
interaction defined by a computer model of the structure of the compound and a 
computer model of the I7L protein, or a portion thereof. Thus, the present invention 
may also comprise a compound identified by docking a computer representation of 
5 the compound with a computer representation of a structure of I7L, or a portion 
thereof, as defined by Table 2. Where I7L is defined by the coordinates of Table 2, 
the spatial arrangement of atoms within the compound may comprise the atomic 
coordinates for at least one of the docking modes as defined in Table 3. In another 
example embodiment, the spatial arrangement of atoms within the compound 
10 comprises the atomic coordinates for at least one of the docking modes as defined in 
Table 4. 

The nature of the interaction between the compound and the protein of interest 
may be defined in teims of the atomic interaction between the compound and the 
protein of interest. In an embodiment, the atomic interaction between a potential 

15 modulator compound and the I7L protein, or a portion thereof, comprises at least one 
atomic interaction selected fi:om the group consisting of charge, electrostatic, 
hydrogen bond, and hydrophobic. Alternatively, the atomic interaction between a 
potential modulator compound and the I7L protein, or portion thereof, may comprise 
at least two hydrogen bond atomic interactions, at least two hydrophobic atomic 

20 interactions, and at least one of a charge or electrostatic interaction. Or, the atomic 
interaction between a potential modulator coinpound and the I7L protein, or a portion 
thereof, may comprise at least three hydrogen bond atomic interactions, at least three 
hydrophobic atomic interactions, and at least one of a charge or electrostatic 
interaction. For example, for I7L, the atomic interactions between the compound and 

25 I7L may comprise at least one of the atomic interactions described in Table 5, Or, the 
atondc interactions between the compound and I7L may comprise at least one of the 
atomic interactions described in Table 6, 

The present invention also comprises pharmaceutical compositions comprising 
compounds able to modify the activity of a protein of interest. In one embodiment, 

30 the protein of interest may comprise I7L. Also, the pharmaceutical compositions may 
comprise anti-viral activity. In one embodiment, the present invention may comprise 
a pharmaceutical composition comprising a compoimd identified by docking a 
computer representation of the compound wilh a computer representation of a three- 
dimensional structure of I7L, or a portion thereof. The structure of I7L or a portion 
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thereof, may comprise at least some of the atomic coordinates as defined by Table 2, 
Also, the three dimensional structure of the compoxmd used in the pharmaceutical 
composition may comprise at least some of the atomic coordinates of at least one of 
flie docking modes as defined in Table 3. Or, the three dhnensional structure of the 
5 compound used in the pharmaceutical composition may comprise at least some of the 
atomic coordinates of at least one of the docking modes as defined in Table 4. 

The nature of the interaction between the compoimd of the pharmaceutical 
composition and the protein of interest may be defined in terms of the atomic 
interaction between the compound and the protein of interest. In an embodiment, the 

10 atomic interaction between a potential modulator compound and the I7L protein, or a 
portion thereof, comprises at least one atomic interaction selected from the group 
consisting of charge, electrostatic, hydrogen bond, and hydrophobic. Altematively, 
Hie atomic interaction between a potential modulator compound and the I7L protein, 
or portion thereof, may comprise at least two hydrogen bond atomic interactions, at 

1 5 least two hydrophobic atomic interactions, and at least one of a charge or electrostatic 
interaction. Or, the atomic interaction between a potential modulator compoimd and 
the I7L protein, or a portion hereof, may comprise at least tiiree hydrogen bond 
atomic interactions, at least three hydrophobic atomic interactions, and at least one of 
a charge or electrostatic interaction. For example, for I7L, the atomic interactions 

20 between the compound able to modify I7L and the I7L protein may comprise at least 
one of the atomic interactions described in Table 5. Or, the atomic interactions 
between the compound able to modify I7L aud.the I7L protein may comprise at least 
one of the atomic interactions described in Table 6, 

The compound may be defined by its ability to interact with amino acids in the 

25 proteinofinterestthatareimportantforcatalyticactivity and/or substrate binding. In 
one embodiment for an I7L modulating compound, the interacting atom or molecular 
group for I7L may comprise the catalytic cysteine of I7L. In one embodiment, the 
atomic interactions with the catalytic cysteine may comprise a charge or electrostatic 
interaction. Or, the interacting atom or molecular group for I7L may comprise at least 

30 one of amino acids Cys328, His241, Asp248, Asp258, of I7L. Or, the interacting 
atom or molecular group for I7L may comprise at least one of amino acids Leu324, 
Trp242, and Gln322 of I7L. Additionally, or altematively, the interacting atom or 
molecular group of I7L.may comprise at least one of Gly329, Leu323, Ser240, 
Trpl68, Aspl94, Asnl71, Serl73, Gin 322, Metl95, Ser326, Glu327, Leu239, 
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Leul77, Asnl99, Metl69, Phe236, ne203, or Mel233 of the I7L protein. In one 
embodiment, the I7L, or a portion thereof, is W I7L. 

Depending on the source of the protein used to generate a three-dimensional 
structure, there may be some variability in the absolute positioning of each amino 
5 acid. Still, due to the high homology maintained among I7L proteins from various 
sources at least in the catalytic triad region, it is to be expected that the relative 
positions of conserved amino acids may be maintained. For example, for I7L, the 
interacting group(s) used to determine the atomic interactions between the compound 
and I7L protein may comprise Cys(]SI), wherein position N corresponds to the 

10 catalytic cysteine of I7L. Additionally, the intemcting group(s) may comprise at least 
one of His(N-87), Asp(N-80), Asp(N-70), of the I7L protein, wherein position N 
corresponds to the catalytic cysteine of I7L. Additionally, the interacting group(s) 
may comprise at least one of Leu0*^-4), Trp(N-86), or Gln(N-6) of the I7L protein, 
wherein position N corresponds to the catalytic cysteine of I7L. Additionally, the 

1 5 interacting group of I7L may comprise at least pne of Gly(N+l), Leu(N-5), Ser(N- 
88), Trp(N-160), Asp(N-134), Asn(N-157), SerCN-155), Met(N-133), Ser(N-2), 
Glu(N-l), Leu(N-89), Leu(N-151), Asn(N-129), Met(N-159), Phe(N-92), ne(N-i25) 
or Met(N-95), wherein position N corresponds to the catalytic cysteine of I7L. 

The compound may also be evaluated for its interaction with a modified I7L 

20 protein, or portion thereof, wherem the I7L comprises at least one of an amino acid 
substitution, an amino acid deletion, or an amino acid insertion. In one embodiment, 
the I7L amino acids used to determine the nature of the association between the 
compound and the I7L protein, or a portion thereof, comprise at least one of wild-type 
or altered amino acid in the I7L protein corresponding to positions 168, 169, 171, 173, ' 

25 177, 194, 195, 199, 203, 233, 236, 239, 240, 241, 242, 248, 258, 322, 323, 324, 326, 
327, 328, or 329 of the wild-type W I7L protein. 

The pharmaceutical composition may comprise the compound present in a 
therapeutically effective amount. In one embodiment, a therapeutically effective 
amotmt may comprise an amount sufficient to reduce a viral load in a subject. The 

30 dosage used for the phannaceutical compositions of the present invention may vary 
depending on the specific compound being used, as well as the methods of 
administration. In one embodiment, a therapeutically effective amount may comprise 
a dose in a range firom about 0.01 to 1,000 mg of active compound per kg body 
weight per day. 
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The phannaceutical compositions and compounds of the present invention 
may be used to treat or prevent a variety of viral infections. The virus may comprise 
an orfhopox virus, such as smallpox virrus, vaccinia virus, monkeypox virus, 
muUuscipox virus, cowpox virus, camelpox virus, variola major virus, variola minor 
5 virus, ectromelia virus, sheeppox vims, lumpy skin viras, Yaba-like vims, swinepox 
virus, rabbit fibroma viras, myxoma virus, fowlpox vims, canarypox viras, or amsacta 
moorei viras. In one example embodiment, the viras is smallpox viras. Also, 
additional anti-viral agents may be employed. 

The present invention also comprises a method of conducting a drag-discovery 

10 business. The method may comprise the step of generating a three-dimensional 

stractural model of a target molecule of interest on a computer. Also, the method may 
comprise generating a three-dimensional stractural model of a potential modulator 
compound of the target molecule on a computer, and docking tiie model for the 
potential modulator compound to with the target molecule so as to minimize the free 

1 5 energy of the interaction between the target molecule and the potential modulator. Jn 
this way, a modulator compound that may interact wilh the target may be identified. 
The method may also include the subsequent steps of providing a modified stracture 
for the modulator compound of interest, and assessing whether the modified stracture 
has a lower free energy of interaction with the target than the original stracture for the 

20 modulator compound. 

The method may further include evaluating at least some of the potential 
modulator conipounds identified by in silico screening in a biological assay. Once 
compounds initially identified by the in silico assay are corroborated by a biological 
assay, animal studies may be used for detailed therapeutic profiling, and 

25 pharmaceutical compositions may then be developed. Or, additional in silico assays 
may be conducted on compounds that appear to be promising based on the biological 
data. 

In another embodiment, the present invention comprises treatment of orthopox 
viral infections using compounds identified by the methods and systems of the present 
30 invention and phamiaceutical compositions comprising such compounds. The virus 
may comprise smallpox virras, vaccinia viras, monkeypox viras, mulluscipox viras, 
cowpox viras, camelpox viras, variola major viras, variola minor viras, ectromelia 
virus, sheeppox virus, lumpy skin viras, Yaba-like viras, swinepox vniis, rabbit 
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fibroma virus, myxoma virus, fowlpox virus, canarypox virus, or amsacta moorei 
virus. In one example embodiment, the virus is smallpox virus. 

The compound may comprise a small organic compound. In one example 
embodiment, the compound may comprise TTP-A, or a salt or prodrug thereof, as 
5 defined herein. Or, the compound may comprise TTP-B, or a salt or prodrug thereof, 
as defined herein. 
Structural Modeling of I7L 

Embodiments of the present invention comprise computer modeling methods 
and systems to identify and optimize specific small molecules that bind to, and thus, 

10 are able to modulate the activity of, a particular target protein. In one embodiment, 
the protein is I7L. Also provided by the present invention are compomids identified 
using the modeling methods described herein. 

Thus, in one embodiment, the present invention provides a method of 
generating a three-dimensional model of a protein, or a portion, thereof. The method 

1 5 may comprise the steps of providing an amino acid sequence of the protein of interest 
and comparing the amino acid sequence of ihe protein of interest to the amino acid 
sequences of othdr proteins to identify a second protein for which a three-dimensional 
structure has been defined, and that has a predetermined level of sequence identity to 
the protein of interest. Once a second protein having a known structure has been 

20 identified, the method may include tihte st^ of aUgning conserved residues firom the 
protein of interest with conserved residues firom the second protein. Next, the 
sequence for the protein of interest may be threaded along the three-dimmsional 
structure of the second protein such that the {position of at least two conserved 
residues firom both proteins are aligned. The conserved residues firom the first protein 

25 and the second protein may comprise residues that are essential for protein Amotion. 

Thus, as a first step, a three-dimensional model of the protein of interest may 
be generated. To generate a three dimensional model of a protein of interest, a 
sequence comparison to proteins with experim^intally determined three-dimensional 
stmctures may be performed. The protein aligned with the protein of interest may 

30 comprise a protein having a similar sequence to the protein of interest. The level of 
sequence identity may range firom at least 5% sequence identity, to more flian 10% 
sequence identity, to more than 20% sequence identity. 

The protein aligned with the protein of interest may not necessarily be 
functionally related to tiie protein of interest. Or, the protein aligned with the protein 
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of interest may comprise a protein having a similar function to the protein of interest. 
In this way, conserved residues that have similar functions in the two proteins may be 
aligned. 

In one embodiment, the protein of interest may comprise I7L. In performing 
5 structural modeling for I7L, a high sequence identity for vaccinia virus (W) I7L is 
found with the C-terminal domain of another cysteine protease, XJbiquitin-like 
protease 1 (ULPl). ULPl protease consists of 221 amino acids, and exhibits a 22% 
sequence identity with I7L. ULPl may be used as a template for building the three 
dimensional model of I7L. 

10 To develop a three dimensioiial structure for I7L, TTPredict™ site search 

algorithms may be used to identify the ligand binding site of I7L based on the location 
of active site residues His241, Asp248, and Cys328, tliat are known to be essential for 
I7L activity. Also, TTPredict™ algorithms may be used to identify known I7L- 
homologous sequences using BLAST searches on protein sequence databases. 

1 5 TTPredict™ algorithms may also be used to access a number of publicly available 
and vendor supplied fold recognition programs to analyze I7L sequence folds (e.g., 
MSI suite of programs, TTPGene). Such sequence comparisons reveal that, as 
compared to other proteins with known 3D structures, the C-terminus domain of 
ULPl (wwPDB Protein Data Bank Archive: PDB code: lEUV) has a high structural 

20 homology to I7L sequence. The ligand binding domain of the I7L sequence (amino 
acids 1 10-423) may be mapped onto residues from the C-terminus of ULPl protease 
domain using 3DPSM and the Homology modeling suites within the Accelrys suite of 
programs (San Diego, CA). Despite having only a 22% sequence identity with I7L, 
the 3D structure of ULPl may be used as a threading template to generate a 3D model 

25 for the I7L query sequence. 

The threading approach may reveal distantly homologous proteins that share 
the same folding structure, but that do not comprise a high amount of sequence 
similarity. Rather than relying only on sequence aligmnent, the fold recognition 
method nmy blend the sequence-to-structure fitness with other structural 

30 characteristics, such as sequence similarity and predicted secondary structures, to find 
conserved residues that appear in both the template protein of interest (e.g., I7L) as 
well as any query sequences, and overlay bofli sequences, maintaining aligmnent of 
the conserved residues. Next, the threading program may match the query sequence 
on the tiiree-dimensional structure of the template using conserved residues of the 
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query protein as the hang points. The resulting model may then be cleaned-up using 
standard energy miiiimization and molecular djoiamics techniques. In one 
embodiment, the conserved residues used as hang points may need to be determined a 
priori. FIG. 1 shows the results of the analysis for I7L and ULPl, wherein the hang 
5 point residues for ULPl (His514, Cys580, and Trp448) are aligned with analogous 
and conserved I7L residues (His241, Cys328, Trpl68) to generate a three-dimensional 
structure for I7L. , 

The present invention also comprises a computer-generated molecular model 
for I7L. For example^ FIG. 2 shows a ribbon representation of I7L ligand binding 

10 domain based on aligmnent of the I7L protein sequence with ULPl to generate a 
three-dimensional structure for the ligand binding domain of I7L. The model may 
comprise the catalytic site required for I7L-mediated cleavage of substrate proteins. 
The model may fiulher include the ligand binding site for antiviral small molecule 
Ugands. The predicted active site residues for I7L may include those residues that 

15 form the catalytic site, or residues that form the ligand binding site, or residues that 
participate in neighboring interactions required to maintain the structure of the ligand 
binding domain and/or the chemical functions required for the catalytic site. In FIG. 
2, amino acid residues that comprise at least a part of the substrate binding pocket are 
labeled. Also, the predicted ligand binding site is shown within the oval shaped area. 

20 The model may be further refined once the initial stmctural coordinates are 

defined. Thus, specific aspects of the model, such as the catalytic site and/or a ligand 
binding site, may be refined to incorporate the structures of substrates or ligands that 
may be bound at that site. I7L has two domains, a cyteine protease domain and a 
DNA regulatory domain. In the present invention, the cysteine protease domain was 

25 modeled, and is referred to as tiie ligand binding domain. The ligand binding domain 
thus includes the catalytic site, where substrate polypeptides axe hydrolyzed, and a 
ligand binding site, where small molecule ligacnds bind. For example, FIG. 3 shows a 
detailed map of the I7L ligand binding site. It can be seen that Gin 322, Cys 328 (the 
catalytic cysteine), Trpl68, Asnl71, Aspl94, Leu239, His241 and Asp258 line the 

30 ligand-binding site to some extent (FIG. 3). As shown in FIG. 3, the catalytic 
cysteine residue, Cys328, is located deep in the pocket. The Trp 168 side chain 
protects the Cys328 residue from Ihe solvent Table 1 lists residues that may 
comprise the I7L ligand binding and catalytic site. In an embodiment, a more detailed 



34 



wo 2005/069012 



PCT/US2004/041329 



view, showing potential intramolecular interactions such as hydrophobic bonds, salt 
bridges and Van Der Waals interactions may be generated. 

Table 1 

Predicted catalytic site residues of I7L 



Cys328, Leu 323, Leu324, His241, Ser240, Trpl68, Aspl94, Asp258, Trp242, 
Asp248, Asnl71, Serl73, Gin 322, Metl95, Gys237, Leu239, Leul77, Mei233, 

ne332, Ilel74, Lysl75, Leul98, Phe234, Glnl92, Ilel61 



5 The structure of I7L may be defined by a graphic two-dimensional figure of a 

three-dimensional model as shown in FIGS. 1-3. The representations shown in FIG. 
1-3 may also be viewed on a computer screen. When visualized on the computer, the 
models may be rotated to provide multiple views. For example, the viewer may rotate 
the model so as to provide a view that is rotated to the right or the left of the views 
10 shown in FIGS. 1-3. Or, the models depicted as FIG. 1-3 may be used to form a 
physical model. 

Additionally, the structure of the I7L protein, or a portion thereof, may be 
defined by the atomic coordinates in three dimensional space. Table 2 provides the 
three-dimensional atomic coordinates for the I7L ligand binding domain, wherein the 

IS position of each atom is defined by a unique x, y, and z coordinate in three 

dimensional space. Shown in Table 2, is the identity of the atom (column 3), the 
amino acid and residue number (cols. 4 and 5), and the actual coordinates for each 
atom in X, y, and z dimensions (cols. 6, 7, and 8, respectively. As described herein, a 
data set of structural coordinates defines the three dimensional structure of a molecule 

20 or molecules. Structural coordinates can be slightly modified and still render nearly 
identical three dimensional structures. A measure of a unique set of structuml 
coordinates is the root-mean-square deviation of the resulting structure. In alternate 
embodiments, structural coordinates that render three dimensional structures that 
deviate firom one another by a root-mean-square deviation of less than 3.0 angstroms, 

25 or less than 2.0 angstroms, or less than 0.5 angstroms, or less than 0.3 angstroms, may 
be viewed by a person of ordinary skill in flie art as identical or equivalent. 
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In SUico Screening Of Putative I7L Modulators 

The present invention further provides methods to dock compounds of 
interest, such as putative therapeutic agents, into the structure of the modeled protein 
5 to determine whether such putative therapeutic agents may interact with the protein. 
In one embodiment, the protein of interest is I7L protein, and the putative therapeutic 
agents are putative modulator compounds. For example, the modulator compounds 
may act as anti-viral agents. Thus, the putative therapeutic agents may bind to the 
ligand binding site and/or catalytic site to modify I7L activity. 

10 To generate a three dimensional model of a potential modulator compound of 

interest, or a plurality of potential modulator compounds, a database of in silica 
structures for potential modulator compounds of int^est, such as provided by 
TTProbes™, may be used. Once the three-dimensional structures of the modulator 
compounds of interest have been generated, the compounds may be docked into the 

1 5 ligand binding site of the protein of interest ^ . 

For example, in one embodiment, the site tested for interaction with potential 
modulator compounds being tested for anti-viral activity may comprise the ligand 
binding domain of I7L as described by the three-dimensional model. The amino acids 
which are assessed for interaction with the test compounds may comprise amino acids 

20 involved in catalysis, such as Cys328 of the W I7L protein. Many of the residues 
relevant for I7L catalytic activity appear to be located in the immediate vicinity of the 
ligand binding site as defined by the three-dimensional model of the present 
invention. For example, in one raabodiment, amino acids important for catatlytic 
activity are included within a 3 angstrom radius of the residues in Table 1 . 

25 Additionally or altematively, the amino acids important for catatlytic activity are 
included within a 3 angstrom radius of the catalytic cysteine, histindine, and/or 
aspartate in the catalytic triad. For example, there are several conserved amino acids, 
including Ser240, His 241, Trpl68, Tip 242, Asp 248, Asp 258, Gin 322, Cys 328, 
and Gly 329, that may be relevant for I7L catalytic activity. Also, compounds may be 

30 specifically tested for their ability to interact in silica with Cys328 as the catal3^c 
cysteine. For I7L, the amino acids assessed for putative interactions with test 
compounds may include at least some of the amino acids listed in Table 1. In one 
example embodiment, the amino acids tested for interaction with the test compound 
may comprise His 241, Tip 242, Asp 248, Asp 258, Ghi 322, Cys 328, Gly 329, 
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Leu324, Leu323, Ser240, Trpl68, Aspl94, Asnl71, Serl73, Metl95, Ser326, 
Glu327, Leu239, Leul77, Asnl99, Metl69, Phe236, ne203, and/or Met233 of the 
vaccinia vims I7L. 

The putative therapeutic agents may comprise a variety of compounds. In one 

5 embodiment, the putative therapeutic agent may comprise a peptide or a 

peptidomimetic. Or, the putative therapeutic agent may comprise an antibody. 

Alternatively, the putative therapeutic agent may comprise a small organic compound. 

FIG. 4 shows a docking mode of a small organic compound with the I7L 

ligand blading domain. The compound shown docked in the ligand binding domain 

10 of I7L is 3-hydroxy-naphthalene-2-carboxylic acid [2-(2-methoxy-4'-mtro-biphenyI- 

I 

3-yl)-ethyl]-amide (TTP-A). TTP-A is shown'as a meshed surface. It can be seen 
that one end of TTP-A makes contact with the catalytic cysteine (Cys328) and 
histidine (His241) residues and at least some of the other catalytic residues listed in 
Table 1. A similar compound, 3-(3'-chloro-4 -fluoro-biphenyl-4-yl)-2-[(4-hydroxy-4'- 
1 5 trifluoromethyl-biphenyl-3-carbonyl)-amino]-iJropionic acid methyl ester, (TTP-B), 
and other similar active analogs, make canonical contacts with active site residues of 
I7L protease. 

The structure of a putative ligand may be provided as a three-dimensional 
space-filling model, as a rotatable model on a computer screen, or as atomic 

20 coordinates in three-dimensional space. In one embodiment, the compounds that 
dock into the ligand binding site with a negative free energy are considered to be 
favorable. In alternative embodiments, a compoimd having an free energy of 
interaction with I7L (or another molecule of interest) of less than -2 kcal/mol, or less 
than — S kcal/mol, or less than -10 kcal/mole, are considered to provide favorable 

25 binding to the protein of interest. For example, Tables 3 and 4 provides the 

coordinates for several computed low-energy docking mpdes for TTP-A and TTP-B, 
respectively. For TTP-A, the energy of interaction is about -1 1 .24 kcal for all five 
docking modes. For TTP-B, the energy of interaction ranges between -8.81 kcal/mol 
to about -10.68 kcal/mol for the five low-energy docking modes, 

30 Thus, the three-dimensional coordinates as listed in Tables 3 and 4 provide the 

low energy structures of TTP-A and TTP-B, respectively, as each compound interacts 
with I7L. The low-energy docking modes for TTP-A as provided in Table 3, and for 
TTP-B as provided in Table 4, may favor interactions wifli at least some of the I7L 
residues listed in Table 1. In Tables 3 and 4, jfrom the left, the second column 
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identifies atom number, the third column identifies atom type, the fifth colunm 
identifies the docking mode (i.e., 1-5) the sixth colimm identifies the x coordinates, 
the seventh column identifies y coordinates, and the eighth column identifies the z 
coordinates. 

5 In one example embodiment, TTP-A, TTP-B, and their derivatives, bind to the 

same binding surface of the I7L model. For isxample, in the predicted docking with 
I7L, active therapeutic compounds will make favorable contacts with at least some of 
the residues shown in Table 1 . As the residues identified in Table 1 appear to be 
required for catalytic activity, it may be of importance that the putative therapeutic 

10 agent recognizes the binding surface that is described in Table 2 and at least some of 
the residues as described in Table 1 to provide the potential inhibit the cysteine 
protease activity of I7L. 

The molecular model may be furflier corroborated by studies of drug-resistant 
mutants. For example, in one embodiment, a drug-resistant virus may be isolated by 

15 passaging of the virus in the presence of the drug of interest. For example, a vaccinia 
virus passaged in the presence of TTP-A may, after several passages, result in the 
emergence of a viral strain that exhibits resistance to the inhibitory effects of TTP-A. 
The resistant virus may be isolated, and the I7L gene sequenced to determine whether 
resistance is due to a change of the I7L protein, such that the TTP-A is no longer as 

20 effective therapeutically. In one embodiment, passaging of the virus in the presence 
of TTP-A may result in a mutation of the I7L protein. For example, passage of 
vaccinia virus in the presence of TTP-A may result in mutations in certain positions of 
the protein. In alternate embodiments, there may be a Y to C mutation at position 1 04 
of the I7L protein, and an L to M mutation at 324 of the I7L protein. FIG. 5 shows a 

25 model of the I7L active site showing that the position of Leu324 is in close proximity 
to the catalytic cysteine, Cys328. 

Tables 5 and 6, list the nature of several atomic interactions for TTP-A and 
TTP-B, respectively, with atoms in the I7L protein. Thus, Tables 5 and 6, identify 
groups on I7L, as defined by Table 2, that interact with the designated atom on TTP- 

30 A or TTP-B, as defined by the first docking mode of either Table 3 or Table 4, 
respectively. By comparing the relative coordinates of the I7L residues to the 
coordinates of Ihe atoms in the first docking modes for TTP-A and TTP-B, the 
distance between the atoms and the type of interaction may be determined. The 
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structures of TTP-A and TTP-B, with the numbering of atoms for each molecule as 
used in Tables 5 and 6, are shown in FIG, 6. 

The molecular model may be used in a computational assay by which virtual 
ligands are inserted into the active site to identify those agents having the highest 
5 potential to bind to, and/or modify, the I7L activity. In a further embodiment, the 
compoimds identified by molecular modeling are tested in a biological assay. For 
example, compounds may be evaluated to determine whether the compoimd displays 
cytotoxic effects on uninfected cells. Additionally, the compound may be evaluated 
to determine the amount of compound that exhibits an inhibition of cytopathic effect 

10 (CPE) of the virus. 

The results of the determination of c5^6toxicity may be compared to the 
efiFectiveness of the compound as an anti-viral agent, to determine the therapeutic 
index (TT) of the compound- The 50% inhibitory concentrations (IC50), measured as 
the concentration of the compound that results in inhibition of the viral cytopathic 

15 effect (CPE) for 50% of treated cells, and the 50% toxicity concentration, measured as 
the concentration of the compound at which 50% of uninfected cells display signs of 
cytotoxicity (TC50), may be compared, and the therapeutic index calculated as the 
value of TC50 divided by IC50. 

• The results of the biological assay may provide further data which can be used 

20 in the next round of molecular modeling. For example, compounds that display a 
large therapeutic index may be further modified in silico to attempt to improve the 
effectiveness of the compound and then reevaluated by a biological assay. The 
process may be repeated imtil a compound maximal TI is identified. In addition, the 
compound may be further developed by animal testing and formulation of an 

25 appropriate pharmaceutical composition. 

In addition to a cell culture assay, a molecular assay of the effectiveness of the 
compounds identified by in silico screening may be performed. For example, the 
ability of a candidate compound such as TTP-A may be evaluated by determining 
whether the compound inhibits proteolysis of a I7L substrate, such as the P4b 

30 precursor protein, by I7L. Such molecular assays may provide evidence that the 
compound of interest is targeting the protein of interest to inhibit catalysijs. If 
inhibition of cleavage of the substrate is not observed, it may indicate that the 
compound identified by in silico screening is acting at a different point of the viral 
formative and/or morphogenic cycle. 
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A schematic of a method used to develop anti-viral agents is shown in FIG. 7. 
Thus, the method may include a first stage 100 of developing a three-dimensional 
model of a protein or polypeptide of interest (e.g., viral I7L). As described herein, the 
5 method may comprise providing the amino acid sequence for the protein or 

polypeptide of interest 110. The sequence of the protein or polypeptide of interest 
may then be compared to amino acid sequences available in protein sequence 
databases 120 to identify proteins or polypeptides that have a known structure, and 
that may be homologous in structure to the protein or polypeptide of interest 130. If a 

10 second polypeptide or protein of known stmcture that has a sequence that includes 
regions of identity to the protein or polypeptide of interest is identified, the second 
protein may be used to align conserved residues from the second protein or 
polypeptide with the first protein or polypeptide of interest 140. The aligned residues 
(hang-points) may then be used as anchors as the first polypeptide or protein of 

1 5 interest is threaded along the structure of the second protein or polypeptide of interest 
to construct a three-dimensional model of the first polypeptide or protein of interest 
150. 

Once a three-dimensional model of the protein or polypeptide of interest has 
been constructed, it may be used in an in silico assay for screening a plurality of 

20 compounds 200. The in slico assay may comprise generating a library of three- 
dimensional structures for potential therapeutic agents 210. For example, in one 
embodiment a library of small high information density organic molecules (i.e., a 
library, wherein each small molecule within the library contains at least one 
functional group of interest) may be prepared. Such a library is provided by 

25 TTProbes™ (TransTech Pharma., Inc., High Point, NC) which is a set of more than 
51,000 phramcophorically diverse molecules of high information density. The in 
silico probes may then be docked into the three-dimensional structure of the protein or 
polypeptide of interest as described herein to determine the atomic interactions 
between the protein/polypeptide and the compound 220. Optionally, the compound 

30 may also be modified by adding or removing molecular firagments firom the 

compound 230, and then the modified compounds docked into the three-dimensional 
structure of the protein or polypeptide of interest 240 to determine how flie changes to 
the structure of the compound may affect the interaction of the compound with the 
protein/polypeptide. Such molecular alterations may be made until there is no longer 
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an apparent improvement in the ability of the compound to interact with the 
polypeptide/protein of interest. For example, for I7L, and using the TTProbes™ in 
silico library, over 3,000 candidate potential I7.L modulators were identified. The 
method may include the option 299 of developing the compounds identified by in 
5 silico screening, or, performing further testing of the compounds by a biological 
assay. 

Thus, still referring to FIG. 7, the putative therapeutic agents (i.e., potential 
modulator compounds) identified by in silico screening may then be evaluated by 
other types of assays for biological activity 300. For example, a putative receptor 

10 ligand may be evaluated using a binding assay. For putative anti-viral agents, the 
compounds may be evaluated to determine whether they inhibit viral growth and 
propagation 310. Also, tilie compounds may be evaluated to determine whether they 
are toxic to uninfected cells 320. For example, results of such biological tests for I7L 
indicate that of the 3,460 compounds identified by in silico screening, 136 inhibit 

15 viral replication and are not toxic. Additionally, compounds may be evaluated to 
determine if they inhibit enzymatic activity of the protein of interest 330. For 
example, for I7L, the cleavage of an I7L substrate, P4b, may be measured by 
electrophoresis of proteins from cell lysates jfrom vaccinia virus-infected cells on 
SDS-PAGE gels. Treatment with TTP-A of viral infected cells results in inhibition of 

20 the cleavage of the P4b protein, as expected if TTP-A inhibits the catalytic activity of 
I7L (Byrd, CM., et al., 2004, J. Virol 78:12147-12156). 

The results of the biological testing may indicate that certain structures are of 
interest as displaying efficacy as anti-viral agents. Thus, there is the option 399 to test 
at least some of these stmctures in additional in silico assays to determine if additional 

25 chemical modifications may be made to the structures to improve the therapeutic 

effects. Or, the compounds may then considered to be optimized, and thus, comprise 

lead compounds for additional animal studies and the like 400. 

Therapeutics 

The invention further provides pharmaceutical compositions comprising the 
30 antiviral active compounds of the invention. The pharmaceutical compositions 
containing a compound of the invention may be in a form suitable for oral use, for 
example, as tablets, troches, lozenges, aqueous, or oily suspensions, dispersible 
powders or granules, emulsions, hard or soft capsules, or syrups or elixirs. 
Compositions intended for oral use may be prepared according to any known method. 
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and such compositions may contain one or more agents selected from the group 
consisting of sweetening agents, flavoring agents, coloring agents, and preserving 
agents in order to provide pharmaceutically elegant and palatable preparations. 
Tablets may contain llie active ingredient in adnadxture with non-toxic 
5 pharmaceutically-acceptable excipients which are suitable for the manufacture of 
tablets. These excipients may be for example, inert diluents, such as calcium 
carbonate, sodium carbonate, lactose, calcium phosphate or sodium phosphate; 
granulating and disiategrating agents, for example com starch or alginic acid; binding 
agents, for example, starch, gelatin or acacia; and lubricating agents, for example 

10 magnesium stearate, stearic acid or talc. The tablets may be uncoated or they may be 
coated by known techniques to delay disintegration and absorption in the 
gastrointestinal tract and thereby provide a sustained action over a longer period. For 
example, a time delay material such as gjyceiyl monostearate or glyceryl distearate 
may be employed. They may also be coated by the techniques described in U.S. 

15 Patent Nos. 4,356,108; 4,166,452; and 4,265,874, to form osmotic therapeutic tablets 
for controlled release. 

Formulations for oral lise may also be presented as hard gelatin capsules 
where the active ingredient is mixed with an inert solid diluent, for example, 
calciimi carbonate, calcium phosphate or kaolin, or a soft gelatin capsules wherein 

20 the active ingredient is mixed with water or an oil mediirai, for example peanut oil, 
liquid parafBn, or olive oil. 

Aqueous suspensions may contain the active compounds in admixture with 
excipients suitable for the manufacture of aqueous suspensions. Such excipients are 
suspending agents, for example sodium carboxymethylcellulose, methylcellulose, 

25 hydroxypropylmethylcellulose, sodium alginate, polyvinylpyrrolidone, gum 

tragacanth and gum acacia; dispersing or wetting agents may be a naturally-occurring 
phosphatide such as lecithin, or condensation products of an alkylene oxide with fatty 
acids, for example polyoxyethylene stearate, or condensation products of etihylene 
oxide with long chain aliphatic alcohols, for example, heptadecaethyl-eneoxycetanol, 

30 or condensation products of ethylene oxide witli partial esters derived from fatty acids 
and a hexitol such as polyoxyethylene sorbitol monooleate, or condensation products 
of ethylene oxide with partial esters derived from fatty acids and hexitol anhydrides, 
for example polyethylene sorbitan monooleate. The aqueous suspensions may also 
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contain one or more coloring agents, one or more flavoring agents, and one or more 

sweetening agents, such as sucrose or saccharin. 

Dispersible powders and granules suitable for preparation of an aqueous* 

suspension by the addition of water provide the active compound in admixture with a 
5 dispersing or wetting agent, suspending agent and one or more preservatives. Suitable 

dispersing or wetting agents and suspending agents are exemplified by those already 

mentioned above. Additional excipients, for example, sweetening, flavoring, and 

coloring agents may also be present. 

The pharmaceutical compositions of the invention may also be in the form of 
10 oil-in- water emulsions. The oily phase may be a vegetable oil, for example, olive oil 

or arachis oil, or a mineral oil, for example a liquid paraffin, or a mixture thereof. 

Suitable emulsifying agents may be naturally-occurring gums, for example gum 

acacia or gum tragacanth, naturally-occurring phosphatides, for example soy bean, 

lecithin, and esters or partial esters derived from fatty acids and hexitol anhydrides, 
IS for example sorbitan monooleate, and condensation products of said partial esters 

with ethylene oxide, for example polyoxyethylene sorbitan monooleate. The 

emulsions may also contain sweetening and flavoring agents. 

Also, oily suspensions may be formulated by suspending the active ingredient 

in a vegetable oil, for example arachis oil, olive oil, sesame oil or coconut oil, or in a 
20 mineral oil such as a liquid parafGn. The oily suspensions may contain a thickening 

agent, for example beeswax, hard paraffin or cetyl alcohol. Sweetening agents such 

as those set forth above, and flavoring agents may be added to provide a palatable oral 

preparation. These compositions may be preserved by the addition of an anti-oxidant 

such as ascorbic acid. 

25 Syraps and elixirs may be formulated with sweetening agents, for example 

glycerol, propylene glycol, sorbitol or sucrose; Such formulations may also contain a 
demulcent, a preservative and flavoring and coloring agents. 

The pharmaceutical compositions may also be in the form of a sterile 
injectible aqueous or oleaginous suspension. This suspension may be formulated 

30 according to the known methods using suitable dispersing or wetting agents and 

suspending agents described above. The sterile injectible preparation may also be a 
sterile injectible solution or suspension in a non-toxic parenterally-acceptable diluent 
or solvent, for example as a solution in 1,3-butanediol. Among the acceptable 
vehicles and solvents that may be employed are water. Ringer's solution, and isotonic 
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sodium chloride solution. In addition, sterile, fixed oils are conveniently employed as 
solvent or suspending medium. For this purpose, any bland fixed oil may be 
employed using synthetic mono- or diglycerides. In addition, fatty acids such as oleic 
acid find use in the preparation of injectibles. 
5 The compositions may also be in the form of suppositories for rectal 

administration of the compoimds of the invention. These compositions can be 
prepared by mixing the drug with a suitable non-irritating excipient which is solid at 
ordinary temperatures but liquid at the rectal temperature and will thus melt in the 
rectum to release the drug. Such materials include cocoa butter and polyethylene 

10 glycols, for example. 

For topical use, as for example for treatment of molluscipox virus, creams, 
ointments, jellies, solutions of suspensions, etc., containing the compounds of the 
invention are contemplated. For the purpose' of this application, topical applications 
shall include mouthwashes and gargles. 

15 The compounds of the present invention may also be administered in the form 

of liposome delivery systems, such as small unilamellar vesicles, large uinlamellar 
vesicles, and multilamellar vesicles. Liposomes may be formed from a variety of 
phospholipids, such as cholesterol, stearylamine, or phosphatidylcholines. 

Also provided by the present invention are prodrags of the invention. 

20 Pharmaceutically acceptable salts of the compounds of the present invention, 

where a basic or acidic group is present in the structure, are also included within the 
scope of the invention. The term "pharmaceutically acceptable salts" refers to non- 
toxic salts of the compounds of this invention which are generally prepared by 
reactiiig the fiee base with a suitable organic or inorganic acid or by reacting the acid 

25 with a suitable organic or inorganic base. Representative salts include the following 
salts: Acetate, Benzenesulfonate, Benzoate, Bicarbonate, Bisulfate, Bitartrate, Borate, 
Bromide, Calcium Edetate, Camsylate, Carbonate, Chloride, Clavulanate, Citrate, 
Dihydrochloride, Edetate, Edisylate, Estolate, Esylate, Fumarate, Gluceptate, 
Gluconate, Glutamate, GlycoUylarsanilate, Hexylresorcinate, Hydrabamine, 

30 Hydrobromide, Hydrocloride, Hydro^qmaphthoate, Iodide, Isethionate, Lactate, 
Lactobionate, Laurate, Malate, Maleate, Mandelate, Methanesulfonate, 
Metfaylbromide, Methylnitrate, Methylsulfate, Monopotassium Maleate, Mucate, 
Napsylate, Nitrate, N-methylglucamine, Oxalate, Pamoate (Embonate), Palmitate, 
Pantothenate, Phosphate/diphosphate, Polygalacturonate, Potassium, Salicylate, 
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Sodium, Stearate, Subacetate, Succinate, Taunate, Tartrate, Teoclate, Tosylate, 
Triethiodide, Trimefhylammoniuin and Valerate. When an acidic substituent is 
present, such as-COOH, there can be formed the ammonium, morpholiniimi, sodiimi, 
potassium, barium, calcium salt, and the like, for use as the dosage form. When a 
5 basic group is present, such as amino or a basic heteroaryl radical, such as pyridyl, an 
acidic salt, such as hydrochloride, hydrobroniide, phosphate, sulfate, trifluoroacetate, 
trichloroacetate, acetate, oxalate, male ate, private, malamute, succinct, citrate, 
tartarate, fumarate, mandelate, benzoate, ciimamate, methanesulfonate, 
ethanesulfonate, picrate and the like. Other salts, which are not pharmaceutically 

10 acceptable, may be useful in the preparation of compounds of the invention; these 
form a further aspect of the invention. 

In addition, some of the compounds identified as binding to, or modulating 
I7L, may form solvates with water or commoii organic solvents. Such solvates are 
also encompassed within the scope of the invention. 

15 Thus, in another embodiment of the present invention, there is provided a 

pharmaceutical composition comprising a therapeutically effective amount of a 
compound identified as binding to or modulating I7L, or a pharmaceutically 
acceptable salt, solvate, or prodrug thereof, and one or more pharmaceutically 
acceptable carriers, excipients, or diluents. In an embodiment of the pharmaceutical 

20 composition, the compoimd identified as binding to or modulating I7L, is an inhibitor 
of orfhopox viruses, including smallpox virus. 

In another embodiment, the present invention provides a pharmaceutical 
composition comprising a therapeutically effective amoimt of the compound 
identified as binding to or modulating I7L, and one or more pharmaceutically 

25 acceptable carriers, excipients, or diluents, wherein said pharmaceutical composition 
is used to replace or supplement compounds that posses antiviral activity. 

In another embodiment, the present invention provides a pharmaceutical 
composition comprising a therapeutically e£Eective amount of the compound ' 
identified as binding to, or modulating I7L, and one or more pharmaceutically 

30 acceptable carriers, excipients^ or diluents, and further comprising one or more 
additional therapeutic agents. 

The compound identified as binding to, or modulating I7L, may administered 
in an amount sufficient to reduce the viral load in a subject. The compound identified 
as binding to, or modulating I7L, may be administered in the form of an oral dosage 
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or parenteral dosage unit. In alternative embodiments, the compoimd identified as 
binding to, or modulating I7L, is administered as a dose in a range from about 0,01 to 
1,000 mg/kg of body weight per day, or as a. dose in a range from about 0.1 to 100 
mg/kg of body weight per day, or as a dose in a range firom about 0.5 to 10 mg/kg of 
5 body weight per day. In another embodiment, the compound identified as binding to, 
or modulating I7L, is used to replace or supplement a compound that inhibits viruses. 

The present invention also provides a prophylactic method for the inhibition of 
pox virus infection comprising administering to a subject in need thereof a compound 
identified as binding to, or modulating I7L, wherein the compoimd is administered to 

10 the subject as a phamiaceutical composition comprising a therapeutically effective 
amount of the compound and one or more pharmaceutically acceptable carriers, 
excipients, or diluents. The therapeutically effective amount of the compound 
identified as binding to, or modulating I7L may inhibit a pox virus. A therapeutically 
effective amount of the compoimd identified as binding to, or modulating I7L, may 

15 comprises an amount sufficient to achieve arid maintain a sustained blood level that at 
least partially inhibit vims growth. In altemative embodiments, the sustained blood 
level of the compound identified as modulating I7L may comprise a concentration 
ranging fi-om about 0.01 pM to 2 mM, or fi-om about 1 jiM to 300 |iM, or fi-om about 
20 pM to about 100 |jM. In another embodiment of the method, the pharmaceutical 

20 composition may further comprise one or more additional therapeutic agents. 

The following is a non-exhaustive listing of adjuvants and additional 
therapeutic agents which may be utilized in combination with the Smallpox inhibitor 
of the present invention: 

1. Analgesics: Aspirin 

25 2. NSAIDs (Nonsteroidal anti-inifiammatory drugs): Ibuprofen, 

Naproxen, Diclofenac 

■» 

3 . DMARDs (Disease-Modifying Antirheimiatic dmgs): 
Methotrexate, gold preparations, hydroxychloroquine, 
sulfasalazine 

30 4. Biological Response Modifiers: Etanercept, Infliximab, 

Glucocorticoids 

In a furrier preferred embodiment, the -present invention provides a method of 
treating or preventing viral - mediated diseases, the method comprising administering 
to a subject in need thereof, a therapeutically effective amount of a compound 
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identified as binding to, or modulating I7L, alone or in combination with therapeutic 
agents selected from the group consisting of antibiotics, hormones, biologic response 
modifiers, analgesics, NSAIDs, DMARDs, or biological response modifiers. In one 
embodiment, the viral disease is caused by an orthopox virus, such as smallpox or 
5 other orthopox viruses. 

For treatment of orthopox-mediated disease, or other viral disease, the 
compound identified as binding to, or modulating I7L, may be administered at a 
dosage level of from about 0.01 to 1000 mg/kg of the body weight of the subject 
being treated, or at a dosage range between 0.01 and 100 mg/kg, or at a dosage range 

10 between 0.5 to 10 mg/kg of body weight per day. The amount of active ingredient 
that may be combined with the carrier materials to produce a single dosage will vary 
depending upon the host being treated and the particular mode of administration. For 
example, a formulation intended for oral administration to humans may contain 1 mg 
to 2 grams of a compound identified as binding to, or modulating I7L, with an 

1 5 appropriate and convenient amount of carrier material, which may vary from about 5 
to 95 percCTt of the total composition. Dosage imit forms may, in one embodiment, 
contain between from about 5 mg to about 500 mg of active in^edient. As is known 
in the art, the dosage may be individualized by the clinician based on the specific 
clinical condition of the subject being treated. Tlius, it will be understood that the 

20 specific dosage level for any particular patient will depend upon a variety of factors 
including the activity of the specific compound employed, the age, body weight, 
general health, sex, diet, time of administration, route of administration, rate of 
excretion, drag combination and the severity of the particular disease undergoing 
therapy. 

25 EXAMPLES 
Example 1: Materials and Methods 

Small organic compound stocks were prepared at a concentration of 10 mM in 
100% dimethyl sulfoxide. The synthesis of TTP-A, TTP-B, and related compound is 
described in U.S. Patent Application 60/493,879, filed August 8, 2003 (TTP 2003- 

30 08). The disclosure of U.S. Patent Application 60/493,879, is hereby incorporated by 
reference in its entirety herein. 

Cell lines used to measure toxicity of the compounds and antiviral effects 
included BSC40 cells, which are BSCl African green monkey kidney cells adapted to 
grow at 40''C (Rac2ynsld, P., et aL, 1983, Virology, 128:458-462). The wGFP line is 
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a Western Reserve vaccinia virus with GGP in the thymidine kinase (TK) locus 
(Byrd, CM., et aL, 2004, J. Virol, 78:12147-12156). 
Example 2: Computer Modeling 

TransTech Pharma's Translational Technology™, described in U.S. Patent 
5 Applications 10/120,278, filed April 10, 2002, 10/410,965, filed April 10, 2003, and 
10/41 1,568, filed April 10, 2003, each of which are incorporated by reference in their 
entireties, was used to model the I7L cysteine protease domain, to discover specific 
small molecule inhibitors, and to optimize I7L binding agents into preclinical drug 
candidates. TransTech Phamia's Translational Technology™ was designed and 

10 developed for rapid lead generation and optimization of drug candidates. The system 
consists of two subtechnologies: TTProbes™ and TTPredict™. TTProbes™ is a set 
of greater than 51,000 pharmacologically diverse molecules. TTPredict™, is a 
computer-based technology that automates high-throughput three-dimensional target 
model building, binding site identification, and conformational analysis. The 

1 5 TTPredict computer program is used to dodk, score, and rank members of TTProbes 
set into a target binding site. 

To develop putative anti-viral compounds, TTPredict™ was used to construct 
threading and homology models for I7L. I7L is known to be a member of the 
cysteine protease super-family and has 423 amino acid residues. Sequence 

20 comparison to proteins with experimentally determined three-dimensional (3D) 
structures showed that the highest sequence identity with vaccinia virus I7L is 
achieved by the Ubiquitin-like protease 1 (ULPl) protease C-terminal domain (PDB 
- code: lEUV). Such sequence comparisons were performed using PDBBlast (available 
on-line at the NCBI web-site), 3DPSM (Bates, P.A., et al., 2001, Enhancement of 

25 Protein Modeling by Human Intervention in Applyiiig the Automatic Programs 3D- 
JIGSAW and 3D-PSSM, Proteins: Structure, Function and Genetics^ Suppl 5:39-46), 
MOE (MOE, Chemical Computing Group) (available on-line at the Chemical 
Computing Group web-site) and SeqFold within the MSI suite of programs (Accelrys 
Inc., San Diego, CA). ULPl is also a member of the cysteine protease super-family 

30 "and has 221 amino acids in the cataljrtic domain. Based on the sequence comparison, 
it was determined that ULPl has a 22% sequence identify with I7L. The 303-residue 
ligand binding domain of I7L sequence (amino acids 1 10-423) was mapped onto 301 
residues firom the C-terminus of ULPl protease domain using 3DPSM and the 
Homology modeling saitQS within the Accelrys suite of programs (San Diego, CA). 
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The sequence of the I7L polypeptide comprising the three-dimensional model of 
Table 2 s provided herein as SEQ ID NO. 1. Despite having only a 22% sequence 
identity with I7L, the 3D structure of ULPl Was successfully used as a threading 
template to generate a 3D model for the I7L query sequence. 
5 I7L and ULPl sequences were aligned in a manner that maintains perfect 

alignment of their conserved residues. In particular, their catalytic Cys ~ His - Trp 
combination from the ULPl catalytic domain were used as hang points to anchor I7L 
sequence on the 3D structure of ULPl . The threading protocols identified a 
Cys/His/Trp hang points triplets in I7L to be residues His241/Cys328/Trpl68. The 

10 corresponding triplets in ULPl protease were identified to be His514/Cys580/Trp448. 
Following threading, the Ca atoms of I7L residues were placed at the corresponding 
Ca positions of UlPl using Homology module (Accelrys, San Diego, CA). The 
resulting structure was energy minimized using Discover (Accelrys) to generate I7L 
structure that served as a model. . The hang point residues are shown in FIG. I, which 

15 also shows the locations of these residues in 17L and ULPl protease. 

Site search algorithms were used to identify the catalytic site of I7L. The 
resulting model agrees well with previous structural and biochentiical studies for 
cysteine proteases. For example, several conserved amino acids including His241, 
TTp242, Asp248, Asp258, Ghi322, Cys328, and Gly329, have been experimentally 

20 shown to be relevant for I7L catalytic activity. In the three-dimensional model 
generated for I7L, it was found that most of these residues are located in the 
immediate vicinity of the catalytic site (FIG. 2). 

Table 2 provides the coordinates for the three dimensional structure for I7L 
developed using the methods of the preset invention. In this table, from the left, the 

25 second column identifies atom number; the third identifies atom type; the fourth 

column identifies amino acid type; the fifih colunm identifies the residue number, the 
sixth colunm identifies the x coordinates, the seventh column identifies y coordinates; 
and the eighth column identifies the z coordinates. Also, shown in the ninth column 
the occupancy, and the last column of Table 2 provides the temperature fector or B 

30 factor. The B factor can be defined as: 

B=8* n*2 (<ud2> + <us2>); 
where <ud2>, is the dynamic variability, and contains information on atom variability 
in an exposed versus buried state, and the temperature dependence on variation; and 
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<us2>, is the static variability, and contains information relating to unresolved 
occupancy, altered electron density, and crystal disorder. The occupancy and B factor 
fields are not required for the analyses descrilDed herein, however. 
Example 3 In Silico Assay 
5 TTProbes were docked into the ligand binding site (FIG. 3), The fit of every 

docked probe was computed using several scoring functions. High-scoring probes 
were identified, and the highest ranking TTProbes were submitted for in vivo 
screening. 

For I7L, the amino acid residues His 241, Trp 242, Asp 248, Asp 258, Gin 

10 322, Cys 328, Gly 329, Leu324, Leu323, Ser240, Trpl68, Aspl94, Asnl71, Serl73, 
Metl95, Ser326, Glu327, Leu239, Leul77, and/or Met233 are predicted to be 
important in binding to substrates. In Table 1, additional amino acid residues that 
potentially bind to the substrate protein as well as tibiat can bind to small molecule 
ligands are listed. Amino acids shown in bold font in the Table 1 are residues that 

1 5 appear to be critical in binding to small molecule ligands. Amino acid residues that 
are not in bold also constitute the ligand binding site. For clarity, only a few amino 
acid residues are identified in FIGS. 1-5, which show the ligand binding site. 

The 51,389 probe molecules comprising TTProbes™ database were then 
docked into the catalytic site. The fit of every docked probe was computed using 

20 several scoring functions. Prior to docking the probes into I7L active site, 1000 low 
energy confoimeis per probe were generated using Monte-Carlo procedures. 
TTPredict™ was used to dock in silico every conformer into the predicted site oSlTL. 
Individual or consensus scoring fimctions including LUDI (Bohm, H.J., 1994, J, 
Camp, Aided Molec. Design, 8:243-256), PLP (Gehlhaar et al, 1995, Chem, Bio., 

25 2:317-324), DOCK (Meng, B.C., et al., 1992, J. Comp, Chem. 13:505-524), LigFit, 
(Accelrys, San Diego, CA), JAIN (Jain, A.N'; 1996, /. Comp. Aided Molec, Design 
10:427-440), and Poisson-Boltzmann (Honig, B. et al., 1995, Science, 268: 1 144-9) 
were used. High consensus scoring probes were identified and the 3,480 highest- 
ranking probes were submitted for in vitro (i.e., biological) testing. This process led 

30 to the identification of several lead compounds including, but not limited to, TTP-A 
and TIP-B. 

Tables 3 and 4 provide the coordiaates for the computed low-energy docking 
modes for TTP-A and TTP-B, respectively. Thus, the three-dimensional coordinates 
as listed in Tables 3 and 4 provide structures for TTP-A and TTP-B as each 
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compound interacts with I7L. The docking modes as provided in Tables 2 and 3 are 
presented in order of increasing energy, where a low energy associated with docking 
the compound into the TIL protein is thermodynamically more favorable than a high 
energy of interaction. The low-energy docking modes for TTP-A and TTP-B as 
5 shown in Tables 3 and 4 favor interactions with I7L residues listed in Table 1 . In 
Tables 3 and 4, from the left, the second column identifies atom number, the third 
column identifies atom type, the fourth colmiin identifies molecule name, the sixth 
column identifies the x coordinates, the seventh column identifies y coordinates and 
the eighth column identifies the z coordinates. The last column of Tables 3 and 4 
10 provides the temperature (B) factor. 
Biological Assay 

The following assay methods may be utilized to identify compounds that are 
effective in showing antiviral activity against vaccinia virus. 

a. Cytotoxicity Assay 

1 5 Cj^pathic effect was measured on the BSC40 african green monkey kidney 

cells using 100 |liM concentrations of the compounds tested in silico. In this assay, 
96-well black Packard viewplates were seeded with BSC40 cells (2.25x10'* cells/well) 
in Minimirai Essential Media supplemented with 5% PCS, 2mM L-glutamine and 10 
|Lig/mL gentamycin sulfate. When the cells became confluent (24 hrs) they were 

20 treated with 100 juM compoimd diluted in media. The cells were placed in an 
incubator at 37*'C (5% CO2) for 24 hours, and checked for toxicity via direct 
observation under the microscope and also with alamar blue which assesses cell 
viability and proliferation (healthy cells produce a visible color change from blue to 
red). The cells were scored on a scale of 0-3 where 0 corresponds to normal healthy 

25 cells, 1 corresponds to unhealthy cells but not rounding up, 2 corresponds to cells that 
are rounding up, and 3 corresponds to cells that have rounded up and pulled off the 
plate. Compounds at concentrations that scored 1 or greater were diluted and the 
above assay was repeated to find the concentration at which the compound scored 0. 
It was found that TTP-A exhibited a TC50 value of about 900 |jM, and TTP-B 

30 exhibited a TC50 value of about 600 jiM. 

b. Anti-viral assay 

A wGFP assay may be performed to test the ability of each compound to 
inhibit viral growth as measured by a reduction in fluorescence from vaccinia virus 
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expressing the green fluorescent protein (wGFP). In this assay, 96-well black 
Packard viewplates are seeded with BSC40 cells in Minimum Essential Media 
supplemented with 5% FCS, 2mM L-glutamine^ and 10 fxg/mL gentamycin sulfate. 
When the cells are confluent they are washed with PBS and then infected with 
5 vaccinia virus at a multiplicity of infection (MOI) of 0. 1 for 30 min in PBS. At 30 
minutes, the cells are overlaid with 100 |il of infection media supplemented with the 
compound of interest in doubling dilutions. As controls, infected cells are treated 
with rifampicin (to block assembly of DNA and protein into mature virus particles), 
AraC, hydroxyurea, with no compound, or mock infected. Cells arc put in a 37®C 

10 incubator (5% CO2) for 24 hrs. At 24 hours post infection (pi), the plates ate removed 
from the incubator, washed with PBS and fluorescence measured on a Wallac plate 
reader (using an excitation of 485 nm and reading at 535 nm). Wells that show 
reduced fluorescence are checked visually under the microscope to verify a reduction 
in viral infection versus a loss of cells due to cjrtopathic effect from vims infection. 

1 5 Compounds that are found to inhibit viral replication are then checked for inhibitory 
effect at various concentrations to determine the IC50 and the therapeutic index. It 
was found that TTP-A exhibited a IC50 value of about 12 (jM, and TTP-B exhibited a 
IC50 value of about 4.6 |jM. 

c. Determination of TI 

20 The 50% inhibitor concentrations (IC50) were determined by cytopathic effect 

(CPE) inhibition as seen by fluorescence using wGFP and plaque reduction assays 
with crystal violet staining or neutral red uptake. The 50% cell toxicity concentration 
(TC50) were determined as the concentrations of compounds that caused 50% of the 
cells to round up and show signs of toxicity both visibly and by the Alamar Blue dye 

25 assay. The thempeutic index was calculated as the value for TC50 divided by IC50. 
For TTP-A, a TI of about 75 was calculated; For TTP-B, a TI of about 130 was 
calculated. 

Examples: Drug-Resistant Viruses 

To demonstrate that the target of TTP-A mediated inhibition is I7L protein, 
30 wGFP was subjected to numerous passages in the presence of TTP-A to generate 
durg-resistant viral mutants (Byrd, CM., et al, 2004, J. Virol. 78:12147-12156), 
Cells were infected with wGFP at an MOI of 0.1 in the presence of the IC50 
concentration of TTP-A for 24 h prior to being harvested. After determinining the 
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titer, a portion of the virus-infected cell extract was used to infect firesh BSC40 cells. 
The titer of virus dropped seven logs from passage 0 to 4. Starting with passage 5, the 
progeny titer began to rise in the presence of the drug until a four log increase was 
observed by passage 7, presumably due to the emergence of a drug-resistant mutant 
5 population. After passage 9, individual viral plaques were purified, and the viral 
DNA isolated and sequenced. All of the resistant viruses were found to have 
mutations in positions 104 and 324, with a Y to C mutation at 104, and an L to M 
mutation at 324. FIG. 5 shows a model of the I7L active site showing the position of 
Leu324 in close proximity to the catalytic cysteine, Cys328. 

10 While the invention has been described and illustrated with reference to 

certain preferred embodiments thereof, those skilled in the art will appreciate that 
various changes, modifications and substitutions can be made therein without 
departing from the spirit and scope of the invention. For example, effective dosages 
other than the preferred dosages as set forth herein may be applicable as a 

1 S consequence of variations in the responsiveness of fhe mammal being treated for 
orthopox-mediated disease(s). Likewise, the specific pharmacological responses 
observed may vary according to and depending on the particular active compound 
selected or whether there are present pharmaceutical carriers, as well as the type of 
formulation and mode of administration employed, and such expected variations or 

20 differences in fhe results are contemplated in accordance with the objects and 
practices of the present invention. 
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TABLE -2 



REMARK Model of I7L cysteine protease domain 
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ATOM 2042 CA PHE 368 
ATOM 2043 CB PHE 368 
ATOM 2044 CG PHE 368 
ATOM 2045 CDl PHE 368 
ATOM 2046 CD2 PHE 368 
ATOM 2047 CEl PHE 368 
ATOM 2048 CE2 PHE 368 
ATOM 20 49 CZ PHE 368 
ATOM 2050 C PHE 368 
ATOM 2051 0 PHE 368 
ATOM 2052 N LYS 369 
ATOM 2053 CA LYS 369 
ATOM 2054 CB LYS 369 
ATOM 2055 CG LYS 369 
ATOM 2056 CD LYS 369 
ATOM 2057 CE LYS 369 
ATOM 2058 NZ LYS 369 
ATOM 2059 C LYS 369 
ATOM 2060 O LYS 369 
ATOM 2061 N SER 370 
ATOM 2062 CA SER 370 
ATOM 2063 CB SER 370 
ATOM 2064 OG SER 370 
ATOM 2065 C SER 37 0 
ATOM 2066 O SER 37 0 
ATOM 2067 N ILE 371 
ATOM 2068 CA ILE 371 
ATOM 2069 CB ILE 371 
ATOM 2070 CG2 ILE 371 
ATOM 2071 CGI ILE 371 
ATOM 2072 CDl ILE 371 
ATOM 2073 C ILE 371 
ATOM 2074 O ILE 371 
ATOM 2075 N LEU 372 
ATOM 2076 CA LEU 372 
ATOM 2077 CB LEU 372 
ATOM 2078 CG LEU 372 
ATOM 2079 CD2 LEU 372 
ATOM 2080 CDl LEU 372 
ATOM 2081 C LEU 372 
ATOM 2082 0 LEU 372 
ATOM 2083 N PHE 373 
ATOM 2084 CA PHE 373* 
ATOM 2085 CB PHE 373 
ATOM 208 6 CG PHE 373 
ATOM 2087 CDl PHE 373 
ATOM 2088 CD2 PHE 373 
ATOM 2089 CEl PHE 373 
ATOM 2090 CE2 PHE 373 
ATOM 2091 CZ PHE 373 
ATOM 2092 C PHE 373 
ATOM 2093 O PHE 373 
ATOM 2094 N ASN 374 
ATOM 2095 CA ASN 374 
ATOM 2096 CB ASN 374 
ATOM 2097 CG ASN 374 
ATOM 2098 ODl ASN 374 
ATOM 2099 ND2 ASN 374 
ATOM 2100 C ASN 37 4 
ATOM 2101 O ASN 374 
ATOM 2102 N LEU 375 
ATOM 2103 CA LEU 375 
ATOM 2104 CB LEU 375 
ATOM 2105 CG LEU 375 
ATOM 2106 CD2 LEU 375 
ATOM 2107 CDl LEU 375 
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■ boo 
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26 . 
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. 514 


27 , 
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29 


,223 


32 


A 




.367 


32 


. 098 




. 975 








, 549 


33 


.162 


•J o 


. 448 


33 


.710 


34 


.714 


34 


.387 


35 


.523 


34 


.315 


32 


.775 


34 


.474 


32 


.884 


35 


.154 


32 


.052 


36 


.307 


31 


.439 


37 


.271 


" 30 


.824 


36 


.641 


29 


.548 



38.620 30.560 
39.348 31.834 
35.882 30.387 
36.477 30.250 
34.838 29.605 
34.451 28.565 
33.292 27.682 
33.737 26.614 
32.585 25.. 650 
34.372 27.235 
34.060 29.165 

34.364 28.606 

33.365 30.314 
32.945 30.855 
31.996 32.050 
31.471 32.113 
30.661 31.089 
31.779 33.147 
30.154 31.086 
31.269 33.146 
30.458 32.118 
34.131 31.280 

34.206 30.989 
35.105 31.971 
36.247 32.451 

37.207 33.269 
36.483 34.537 
35.692 35.105 
36.758 34.995 
37.011 31.277 
37.540 31.315 
37.099 30.205 
37.815 29.003 
37.964 28.066 
38.995 26.926 
40.379 27.498 
38.548 25.833 



80.153 1.00 59.75 
81.136 1.00 59.75 
80.529 1.00 59.75 

80.426 1.00 59.75 
80.091 1.00 59.75 
79.873 1.00 59.75 
79.538 1.00 59.75 

79.427 1.00 59.75 
78.886 1.00 59.75 
77.853 1.00 59,75 
78.918 1.00110.95 
77.736 1.00110.95 
77.952 1.00110.95 
76.683 1.00110.95 
76.907 1.00110,95 
75.656 1.00110.95 

75.183 1.00110.95 
76.659 1.00110.95 
75.494 1.00110.95 
77,028 1.00 75.96 
76.078 1.00 75.96 
76.757 1.00 75.96 
75.810 1.00 75,96 
75.399 1.00 75.96 

74.184 1.00 75.96 
76,163 1.00124.77 
75.569 1,00124.77 
76.546 1.00124.77 
77.119 1.00124.77 
75.853 1.00124.77 
75.428 1.00124.77 
74.597 1.00124.77 
73.530 1,00124.77 
74.927 1.00105.73 
74.019 1.00105.73 
74.510 1.00105.73 
75.522 1.00105.73 
75.845 1.00105.73 
76.772 1.00105.73 
72.714 1.00105.73 
71.662 1.00105.73 
72.732 1.00 64.70 
71.478 1.00 64.70 
71.593 1.00 64:70 
70.211 1.00 64.70 
69.784 1.00 64.70 
69.361 1.00 64.70 
68.513 1.00 64.70 
68.086 1.00 64.70 
67.664 1.00 64.70 
70.664 1.00 64.70 
69.472 1.00 64.70 
71.286 1.00 35.72 
70.559 1.00 35.72 
71.437 1.00 35.72 
71.865 1.00 35.72 

71.114 1.00 35.72 

73.115 1.00 35.72 
70.026 1.00 35.72 
68.917 1.00 35.72 
70.833 1.00138.95 
70.512 1.00138.95 
71.731 1.00138.95 
71.570 1.00138.95 
71.223 1.00138.95 
70.589 1.00138.95 
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END 









375 37.093 28.291 

375 37.711 27.552 

376 35.766 28.517 
376 34.901 27.828 
376 36.992 27.064 
376 36.407 28.177 
376 35.031 28.206 
376 38.477 28.713 
376 37.327 29.175 
376 38.228 27.440 
376 35.014 26.360 

376 35.402 25.581 

377 34.645 25.976 
377 34.668 24.635 
377 34.286 24.519 
377 35.531 24.786 
377 36.651 24.598 
377 35.380 25.159 
377 33.737 23;760 

377 33.945 22.551 

378 32.671 24.325 
378 31.748 23.489 
378 30.589 24.271 
378 29.593 24.813 
378 30,294 25.697 
378 28,792 23.673 
378 32.477 22.779 

378 ' 32.211 21.606 

379 33.403 23.460 
379 34.122 22.754 
379 34.647 23.656 
379 35.608 24.576 
379 35.317 22.106 
379 36.181 22.858 
379 35.380 20.848 
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ATOM 
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ATOM 

ATOM 

ATOM 

ATOM 
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ATOM 

END 

REMARK 

REMARK 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 



TABLE- 3 

low-energy 
Binding 



TTP-A Computed 
Free Energy of 

1 C9 MOL 1 

2 C4 MOL 1 

3 C12 MOL 1 

4 CI MOL 1 

5 C3 MOL 1 

6 C7 MOL 1 

7 CI 4 MOL 1 

8 CI 6 MOL 1 

9 N17 MOL 1 

10 H17 MOL 1 

11 CI 8 MOL 1 

12 02 0 MOL 1 

13 C19 MOL 1 

14 C22 MOL 1 

15 025 MOL 1 

16 H25 MOL 1 

17 C24 MOL 1 

18 C30 MOL 1 

19 C28 MOL 1 

20 C27 MOL 1 

21 C29 MOL 1 

22 C26 MOL 1 

23 C23 MOL 1 

24 C21 MOL 1 

25 C2 MOL 1 

26 C6 MOL 1 

27 Oil MOL ) 1 

28 CIS MOL 1 

29 Nl MOL 1 

30 02 MOL 1 

31 03 MOL 1 

32 CIO MOL 1 

33 C5 MOL 1 

34 08 MOL .1 

35 C13 MOL 1 

TTP-A Computed 
Free Energy of 

1 C9 MOL 

2 C4 MOL 2 

3 CI 2 MOL 2 

4 CI MOL 2 

5 C3 MOL 2 

6 C7 MOL 2 

7 C14 MOL 2 

8 CI 6 MOL 2 

9 N17 MOL 2 

10 H17 MOL 2 

11 CI 8 MOL 2 

12 02 0 MOL 2 

13 C19 MOL 2 

14 C22 MOL 2 

15 025 MOL 2 

16 H25 MOL 2 

17 C24 MOL 2 

18 C30 MOL 2 

19 C28 MOL 2 

20 C27 MOL 2 

21 C29 MOL 2 

22 C26 MOL 2 

23 C23 MOL 2 

24 C21 MOL 2 



docking mode-1 
= -11.24 kcal/mol 

14.331 35.024 76.250 

15.175 35.446 77.272 

14.648 35.356 74.948 

16.340 36.195 77.047 

16.661 36.522 75.696 

15.793 36.088 74.698 

13.069 34.215 76.522 

12.147 35.063 77.449 

10.746 34.939 77.134 

10.090 34.523 77.810 

10.294 35.400 75.880 

11.112 35.909 75.103 

8.875 35.299 75.453 

8.475 34.568 74.332 

9.402 33.882 73.533 

9.655 33.026 73.978 

7.129 34.514 73.990 

3.884 35.791 75.169 

•4.833 35.121 74.403 

5.623 36.575 76.636 

4.281 36.517 76.287 

6.177 35.180 74.753 

6.572 35.908 75.868 

^ 7.916 35.963 76.214 

17.063 36.529 78.390 

17.552 35.601 79.301 
18.194 35.978 80.472 
18.364 3.7.322 80.761 
19.005 37.698 81.944 
18.437 38.366 82.727 
20.246 37.308 82.183 
17.889 38.277 79.867 
17.251 37.877 78.694 
17.778 37.255 75.046 
17.743 38.555 74.504 



low-energy 
Binding 
2 14. 

15 
14 
16 
16 
15 
13 
12 
10 
10 



6 



8 



docking mode- 2 
= -11.24 kcal/mol 

546 35.957 75.646 

319 36.210 76.776 

848 36.618 74.473 

,397 37.106 76.783 

.705 37.776 75.560 

.909 37.503 74.451 

377 34.979 75.667 

.063 35.802 75.507 

.877 34.985 75.452 

.840 34.151 74.849 

.769 35.352 76.244 

,861 36.351 76.969 

.495 34.588 76.243 

.581 34.648 75,187 

.816 35.443 74.057 

.798 35,512 73.896 

.408 33.905 75.251 

.690 31.567 77.516 

.959 32.366 76.409 

774 32.243 78.510 

599 31.506 78.566 

135 33.104 76.354 

042 33.044 77.405 

216 33.783 77.344 



-0.49 
-0.50 
-0.41 
-0.52 
-0.49 
-0.42 
-0.35 
-0.61 
-0.19 
-0.19 
-0.47 
+0.04 
-0.39 
-0.31 
-0.10 
-0.14 
-0.26 
-0.09 
-0.13 
-0.25 
-0.15 
-0.18 
-0.27 
-0.39 
-0.54 
-0.75 
• -0.84 
-0.83 
-0.59 
-0.29 
-0.77 
-0.73 
-0.60 
+0.02 
-0.36 



-0.41 
-0.43 
-0.33 
-0.49 
-0.43 
-0.35 
-0.40 
-0.40 
-0.12 
+0.28 
-0.52 
-0.12 
-0.49 
-0.31 
+ 0.13 
+0.11 
-0, 



.29 
-0.33 
-0.31 
-0.59 
-0.47 
-0.36 
-0.54 
-0.53 
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75 
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+0. 


00 
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-0. 


29 


END 
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+ 0. 
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+ 0. 


10 
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72 
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-0. 
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-0. 


38 
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3 
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74 
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-0. 


24 


ATOM 
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3 


11.629 
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76 


.092 


-0. 


38 


ATOM 
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3 


10.587 


35.572 


76 


.478 


-0. 


54 


ATOM 
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3 


11.181 


35.897 


73 


.780 


-0. 


26 


ATOM 


23 


C23 


MOL 




3 


11.925 


36.568 


74 


.743 


-0. 


30 


ATOM 
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C21 


MOL 




3 


12.967 


37.399 


74 


.351 


-0. 


25 


ATOM 
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76 
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.791 
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3 
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78 


.044 


-0. 


15 


ATOM 


35 
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3 
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79 
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69 


END 
























REMARK 


TTP' 


-A Computed 


low- 


-energy docking mode- 


-4 








REMARK 


Free Energy 


of 


Binding = 


•11.24 kcal/mol 






ATOM 


1 


C9 


MOL 




4 


10. 690 


36. 976 


72 


.708 


-0. 


14 


ATOM 


2 


C4 


MOL 




4 


9.929 


36.252 


73 


.621 


-0. 
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19 


C28 


MOL 


4 


20 


C27 


MOL 


4 


21 


C29 


MOL 


4 


22 


C26 


MOL 


4 


23 


C23 


MOL 


4 


24 


C21 


MOL 


4 


25 


C2 


MOL 


4 


26 


C6 


MOL 


4 


27 


Cll 


MOL 


4 


28 


C15 


MOL 


4 


29 


Nl 


MOL 


4 


30 


02 


MOL 


4 


31 


03 


MOL 


4 


32 


CIO 


MOL 


4 


33 


C5 


MOL 


4 


34 


OS 


MOL 


4 


35 


C13 


MOL 


4 



16 . 


«l o ^ 


.J u • 


781 


73. 


342 


-0. 


21 


15 - 




36 . 


835 


72 . 


698 


-0. 


17 


J. u ■ 


O f o 


OO m 


•I? X W 


75. 


681 


-0. 


50 


16 . 






223 


79. 


197 


-0. 


76 


17 . 


«• x^ 




870 


78. 


027 


-0. 


58 


1 R 
j.^ ■ 




Ofl • 


174 


78. 


022 


-0. 


74 


15 . 


729 


34 . 


375 


79. 


194 


-0. 


69 








667 


76. 


854 


-0. 


55 




, J3 f 


o *± • 


,818 


76. 


851 


-0. 


.55 


1 4 

X 1 « 


683 


«j 1 « 


. 621 


75. 


.676 


-0. 


.45 


8 


> ^ / X 


35 . 


.757 


75, 


,595 


-0. 


.37 


7 


• X 


O-J - 


. 599 


75. 


.566 


-0. 


.28 


Ci 

D 


• ^ 1 \j 


34 


. 626 


76 


.311 


-0 


.35 


7 


.275 


33 


.771 


77 


.117 


-0 


.52 


6 


.615 


32 


.797 


77 


.870 


-D 


.34 


6 


.313 


33 


.026 


78 


.983 


^0 


.12 


6 


.333 


31 


.619 


77 


.337 


-0 


.05 


8 


.659 


33 


. 901 


77 


,165 


-0 


.44 


9 


.294 


34 


.882 


76 


.405 


-0 


.52 


9 


.280 


39 


.141 


76 


.118 


-0 


.03 


B 


.017 


39 


.717 


76 


.363 


-0 


.25 



TTP-A Computed 
Free Energy of 

1 C9 MOL 

2 C4 MOL 

3 C12 MOL 

4 CI MOL 

5 C3 MOL 

6 C7 MOL 

7 CI 4 MOL 

8 CI 6 MOL 

9 N17 MOL 

10 H17 MOL 

11 C18 MOL 

12 O20 MOL 

13 C19 MOL 

14 C22 MOL 

15 025 MOL 

16 H25 MOL 

17 C24 MOL 

18 C30 MOL 

19 C28 MOL 

20 C27 MOL 

21 029 MOL 

22 026 MOL 

23 023 MOL 

24 021 MOL 

25 02 MOL 
2 6 06 MOL 

27 Cll MOL 

28 C15 MOL 

29 Nl MOL 

30 02 MOL 

31 03 MOL 

32 CIO MOL 

33 C5 MOL 

34 08 MOL 

35 C13 MOL 



low-energy docking mode-5 
Binding = -11.24 kcal/mol 

105 36.487 73.198 -0.23 

920 36.000 72.657 -0.23 

046 37.547 74.081 -0.23 

659 36.532 72.965 -0.18 

612 37.621 73.887 -0.18 

817 38,087 74.405 -0.18 

463 35.893 72.844 -0.13 

,872 34.918 73.989 -0.37 

.745 35.515 74.967 -0.28 

.727 35.724 74.738 -0.00 

.225 35.808 76.245 -0.45 

.035 35.558 76.475 -0.04 

.049 36.404 77.329 -0.46 

.163 3^7.784 77.517 -0.33 

.500 38.696 76.683 -0.02 

531 38.465 76.641 +0.09 

.951 38.273 78.552 -0.62 

.086 37.030 81.280 -0..78 

.412 37.904 80.433 -0,77 

.187 35.158 80.063 -0.78 

972 35.656 81.094 -0.67 

.626 37.405 79.401 -0.59 

515 36.032 79.216 -0.67 

727 35.539 78.184 -0.65 

528 35.786 72.190 -0.19 

139 36.037 70,880 -0.13 

.127 35.322 70,256 -0.06 

.470 34.317 70.946 -0.22 

448 33.603 70.316 -0.41 

.410 34.120 70.120 +0.13 

642 32.348 69.943 -0.23 

838 34,037 72.258 -0.34 

860 34.764 72.865 -0.31 
514 38.430 74.478 . +0.14 

926 38.269 75,749 -0.34 



5 

5 

5 

5 

5 

5 

5 

5 

5 

5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 
5 



13. 
11. 
13, 
10 
10 
11 
14 
14 
15 
16 
15. 
14. 
16. 
16. 
15. 
14, 
16. 
19. 
18, 
18 
18 
17 
17 
16 
9 
9 
8 
7 
6 
5 
6 
7 
8 
9 
8 
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Table-4 



PCTAJS2004/041329 



REMARK 4 ^„^^rr^/ of binding = -10 . 68kcal/iaol 

REMARK Computed Free energy of ^inaing 
REMARK TTP-B Computed low-energy docking_moae^ 



ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
END 

REMARK 



1 

2 

3 

4 

5 

6 

7 

8 

9 
10 
11 
12 
13 
14 
15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 

41 



CI 
C3 
C4 
C7 
C8 



TTB 
TTB 
TTB 
TTB 
TTB 



C12 TTB 
C2 TTB 



C5 
C9 
C13 
C 
F2 
F3 
F4 



TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
CIO TTB 
C6 TTB 
Cll TTB 
015 TTB 
N14 TTB 
C17 TTB 
CI 8 TTB 
C20 TTB 
C23 TTB 
C26 TTB 
C28 TTB 
C29 TTB 
C30 TTB 
C32 TTB 
C34 TTB 
F36 TTB 
C33 TTB 
C31 TTB 
C27 TTB 
C24 TTB 
CI 9 TTB 
022 TTB 
021 ,TTB 
C25 TTB 
016 TTB 
HI 4 TTB 
HI 6 TTB 



1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 



16.460 
15.068 
17.148 
14.360 
16.447 
15.058 
17.213 
17.535 
18.241 
18.621 
19.378 
19.510 
20.635 
18.761 
18.301 
17.596 



12 
12 
12 
10. 
10. 

9. 

7 . 

6. 



874 
223 
209 
796 
270 
,070 
.858 
751 
6.852 
5.684 
4.745 
3.634 
3.460 
2.387 
4.397 
5.510 
8.062 
9.168 
10.463 
11.083 
9.471 
9.019 
14.394 
12.755 
14.333 



36.607 
36.607 
35.418 
35.417 
34.230 
34.223 
37.852 
38.651 
39.834 
40.220 
41.498 
42.182 
41.175 
42.258 
39.422 
38 .240 
35.461 
34.713 
36.345 
36.441 
35.386 
34.590 
34.704 
33.978 
33.133 
32.375 
32.975 
32.260 
30.940 
30.249 
30.339 
31.056 
33.011 
33.737 
37.836 
38.357 
38.542 
39.626 
33.003 
36.931 
32.541 



76.845 
76.816 - 
77.039 - 
76.974 - 
77.202 - 
77.169 - 
76.663 - 
77.755 - 
77.565 - 
76.284 ■ 
76.076 ■ 
77.268 • 
75.610 ■ 
75.105 
75.193 
75.383 
76,923 
77.662 
056 
065 
053 
558 
888 
,314 
.413 
857 
688 
113 
709 
123 
,879 
.451 
77 .085 
76.656 



76. 

76. 

75. 

75. 

74. 

75< 

76. 

76. 

77. 

78. 

77. 

78. 

76. 

76, 



75 
74 
76 
75 
77 
75 



,633 
.697 
.299 
.493 
.335 
.407 



76.458 



0.00 
0.00 
0.00 
0.00 
•0.00 
•0.00 
•0.00 
■0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
0.00 
-0.00 
-0.00 
-0.00 
0.00 
0.00 



0.51 
0.27 
0.61 
0.51 
0.66 
0.63 
0.48 
0.51 
0.51 
0.44 
0.34 
0.28 
0.35 
0.12 
0.36 
0.40 
0.57 
0.12 
0.26 
0.46 
0.34 
0.42 
0.30 
0.29 
0.37 
0.37 
0.28 
0.21 
0.27 
0.18 
0.43 
0.47 
0.43 
0.47 
0.21 
0.09 
0.06 
0.28 
0.15 
0.09 
0.03 



TTP-B computed ^^w-energy doc)cing m^^^ 
REMARK computed Free energy of bxn^^^^ ^^9.5^ ^^^^^ 

ATOM 1 CI TTB ^ . ^ ^.oo -^c: -700 74.231 



ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 



8 
9 
10 
11 
12 
13 
14 
15 
16 



C3 

C4 

C7 

C8 

C12 

C2 

C5 

C9 

C13 

C 

F2 

F3 

F4 



TTB 

TTB 

TTB 

TTB 

TTB 

TTB 

TTB 

TTB 

TTB 

TTB 

TTB 

TTB 

TTB 



CIO TTB 
C6 TTB 



10.922 
10.133 
9.674 
8.888 
8.654 
12.456 
12.595 
13.838 
14 .940 
16,280 
17 .004 
16.070 
16.951 
14.803 
13.562 



35.738 
36.554 
35.983 
36.804 
36.520 
35.751 
34.762 
34.526 
35.280 
35.031 
36.205 
34.559 
34.050 
36.268 
36.503 



74.231 
76.352 
73.662 
75.789 
74.449 
76.186 
77.154 
77.732 
77.344 
77.970 
78.046 
79.248 
77.273 
76.377 
75.799 



-0. 
-0. 
-0, 
-0, 
-0. 
-0. 



00 
,00 
.00 
.00 
.00 
00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 



0.41 

0.28 

0.50 

0.22 

0.37 

0.20 

0.49 

0.54 

0.65 

0.53 

0.65 

0.44 

0.54 

0.52 

0.42 

0.37 
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ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

END 

REMARK 

REMARK 

REMARK 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM. 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 



17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 

39 

40 

41 

4 



Cll 
015 
N14 
C17 
CI 8 
C20 
C23 
C26 
C28 
C29 
C30 
C32 
C34 
F36 
C33 
C31 
C27 
C24 
C19 
022 
021 
C25 
016 
HI 4 
H16 



TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 
TTB 



2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 

2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 



9.471 
9.001 
9.794 
9.527 
10.801 
11.963 
12.818 
13.876 
14.084 
15.200 
16.254 
17.316 
17,326 
18.351 
16.273 
15.209 
13.232 
12.173 
8.350 
8.431 
.170 
.157 
.384 
10.230 
6.855 



7- 
6, 

7. 



35.654 

34.550 

36.582 

36.296 

36.688 

35.716 

35.881 

35.001 

33.950 

33.030 

32.999 

32.127 

31.282 

30.438 

31.311 

32.185 

33.776 

34.657 

37 .140 

38.373 

36.528 

36.738 

36.786 

37 .479 

35.946 



72.225. • 

71.921 

71.220 

69.859 

69.061 

69.239 

70.323 

70.522 

69.636 

69.849 

68.944 

69.141 

70.245 

70.433 

71.152 

70.955 

68 .553 

68.357 

69.473 

69.525 

69.074 

70.054 

73.924 

71.476 

73.905 



•0.00 
•0.00 



-0, 

~0 

-0 

-0 

-0 

-0 

-0 

-0 



00 
00 
00 
,00 
.00 
.00 
.00 
.00 



-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
0.00 
0.00 
-0.00 
0.00 
0-00 
0.00 



0.20 

0.23 

0.12 

0.12 

0.11 

0.17 

0.22 

0.30 

0.31 

0.40 

0.39 

0.48 

0.62 

0.52 

0.62 

0.51 

0.06 

0.15 

0.07 

0.23 

0.20 

0.04 

0.19 

0.12 

0.11 



TTP-B Computed low-energy docking 
Computed Free energy of ^i^^^^g - 



1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 



CI 
C3 
C4 
C7 
C8 



TTB 
TTB 
TTB 
TTB 
TTB 



3 
3 



C12 TTB 
C2 TTB 



C5 
C9 



TTB 
TTB 



C13 TTB 



C 
F2 
F3 
F4 



TTB 
TTB 
TTB 
TTB 



ClO TTB 
C6 TTB 
Cll TTB 
015 TTB 
N14 TTB 
C17 TTB 
C18 TTB 
C20 TTB 
C23 TTB 
C26 TTB 
C28 TTB 
C2 9 TTB 
C30 TTB 
C32 TTB 
C34 TTB 
F36 TTB 
C33 TTB 
C31 TTB 
C27 TTB 
C24 TTB 
CI 9 TTB 
022 TTB 
021 TTB 



16.497 
15.398 
16.865 
14.666 
16.137 
15.041 
17.281' 
17.913 
18-662 
18.782 
19.594 
19.158 
20.905 
19.556 
18.150 
17.402 
13.506 
13.425 
12.506 
11.364 
11.085 
12.345 
13.086 
14.229 
14.638 
15.842 
15.774 
16.915 
18.128- 
19.230 
18.198 
17.055 
13.900 
12.756 
10.246 
9.666 
9.881 



mode- 3 

-9.72kcal/mol 
36.818 77.723 



36.382 
36.149 
35.272 
35.044 
34.602 
37.975 
37 .972 
39.074 
40.175 
41.362 
42.503 
41.128 
41.491 
40.181 
39.081 
34.832 
33.646 
35.744 
35.307 
36.377 
36.921 
36.082 
36.554 
37,870 
38.359 
38.788 
39.253 
39.288 
39.736 
38.860 
38.395 
38.716 
38.242 
35.227 
36.259 
33.994 



76.989 
78.882 - 
77.408 - 
79.305 - 
78.574 - 
77.281 • 
76.042 - 
75.642 - 
76.482 • 
76.055 
76.698 
76.411 
74 .683 
77 .719 
78.118 
76.586 
76.245 
76.206 
75.490 
74.398 
73.732 
■ 72.908 
72.273 
72.458 
71.790 
70.470 
69.830 
70.510 
69.889 
71.829 
72.471 
73.277 
73.910 
76.485 
76.846 
77.008 



-0.00 
-0.00 



•0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0-00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0-00 
-0.00 
-0-00 
-0.00 
-0.00 



0.50 
0.37 
0.68 
0.52 
0.74 
0.72 
0.43 



20 
,26 
.47 



0.37 

0.23 

0.39 

0.18 

0.43 

0,61 

0.54 

0.14 

0.31 

0.35 

0.27 

0.22 

0.25 

0.15 

0.18 

0.11 

0.13 

0.14 

0.22 

0.17 

0.32 

0.25 

0.16 

0,19 

0.52 

0.11 

0.17 
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ATOM 
ATOM 
ATOM 
ATOM 
END 
REMARK 
REMARK 
REMARK 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
END 

REMARK 

REMARK 

REMARK 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 



38 C25 TTB 

39 016 TTB 

40 H14 TTB 

41 H16 TTB 



3 
3 
3 
3 



8.461 33,878 77.034 

14.339 33.484 79,037 

12,604 36.740 76.451 

13.868 33.718 79.880 



-0.00 0.54 

-0.00 0.10 

0.00 0.07 

-0.00 0.28 



TTP-B computed low-energy docking "^°J^:5j._i/^ol 
computed Free energy of b.ndxng = - 231ccal/mo 



1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 



CI 
C3 
C4 
C7 
C8 



TTB 
TTB 
TTB 
TTB 
TTB 



C12 TTB 
C2 TTB 



C5 
C9 



TTB 
TTB 



C13 TTB 
C TTB 



F2 
F3 
F4 



TTB 
TTB 
TTB 
CIO TTB 
C6 TTB 
Cll TTB 
015 TTB 
N14 TTB 
C17 TTB 
C18 TTB 
C20 TTB 
C23 TTB 
C26 TTB 
C28 TTB 
C29 TTB 
C30 TTB 
C32 TTB 
C34 TTB 
F36 TTB 
C33 TTB 
C31 TTB 
C27 TTB 
C24 TTB 
C19 TTB 
022 TTB 
021 TTB 
C25 TTB 
016 TTB 
H14 TTB 
HI 6 TTB 



4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 
4 



17.143 
17.445 
16.386 
16.986 
15.929 
16.224 
17.609 
16.917 
17.367 
18.503 
18.985 
17.946 
19.942 
19,593 
19.194 
18.747 
17.321 
18.340 
16.501 
16.864 
15.710 
14 .449 
. 13.472 
12.320 
12.138 
10.923 
9.795 
8.638 
8.609 
7.491. 
9,736 
10.895 
13.112 
14.265 
17.003 
16.055 
18.203 
18.897 
15,741 
15.628 
14.949 



39.598 
38.517 
39.404 
37.239 
38.132 
37.051 
40.947 
41.727 
43.009 
43.512 
44.897 
45. 676 
44.796 
45.448 
42.733 
41.453 
36.115 
35.448 
35.815 
34.795 
33.755 
34 .240 
34 .897 
35.366 
35.180 
35.673 
34 .865 
35.328 
36.601 
37 .046 
37 .411 
36.948 
34.528 
34.061 
35.456 
36.094 
35.360 
36.603 
35.788 
36.342 
35.568 



75.386 
76.209 
74.239 
75.892 
73.917 
74.737 
75.720 - 
76.641 - 
76.940 - 
76.315 - 
76.631 - 
77.099 - 
77.619 - 
•75.524 - 
75.396 • 
75.098 ■ 
76.808 ■ 
76.594 
77.909 
78.823 
78.814 
78.105 
78.844 
78.223 
76.857 
76.211 
76.154 
75.541 
74.983 
74.388 
75.039 
75.654 
76.111 
76.735 
80.161 
80.636 
80.851 
80.797 
74.375 
78.054 
74.931 



0.00 
0.00 
0.00 
■0.00 
-0.00 
-0.00 
-0.00 
-0-00 
-0.00 
-0.00 



0.31 
0.44 
0.23 
0.48 
0.28 
0.41 
0.27 



00 
00 
00 
00 
00 
00 
00 
0.00 
•0,00 
•0.00 
•0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0,00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
-0.00 
0.00 



29 
16 
18 
,12 
,09 
.15 
.03 
0.16 
0.22 
0.49 
0.20 
0.35 
0.70 
0.52 
0.72 
0.25 
0.39 
0.61 
0.53 
0.51 
0.39 
0.25 
0.10 
0.29 
0.35 
0.36 
0.58 
0.75 
0.01 
0.28 
0.80 
0.06 
0.17 
0.10 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 



CI 
C3 
C4 
C7 
C8 



TTB 
TTB 
TTB 
TTB 
TTB 



C12 TTB 
C2 TTB 



C5 
C9 
C13 
C 

F2 
F3 



TTB 
TTB 
TTB 
TTB 
TTB 
TTB 



5 

5 

5 

5 

5 

5 
5 
5 
5 
5 
5 
5 
5 



4. 


526 


31. 


683 


5. 


097 


32. 


667 


5. 


268 


31. 


104 


6. 


410 


33, 


080 


6. 


579 


31. 


509 


7 


152 


32, 


494 


3 


.140 


31.254 


2 


.845 


29 


.923 


1 


.526 


29 


.534 


0 


.504 


30 


.475 


-0 


.918 


30 


.059 


-1 


.743 


30 


.598 


-1 


.306 


30 


.543 



76.605 
75.804 
77.625 
76.022 
77.844 
77.050 
76.392 
76.117 
75.908 
75.970 
75.738 
76.705 
74.507 



-0.00 

-0,00 

-0.00 

-0.00 

-0.00 

-0.00 

-0.00 

-0.00 

-0.00 

-0.00^ 

-0.00 

-0.00 

-0.00 



0.33 

0.30 

0.41 

0.31 

0.56 

0.48 

0.29 

0.35 

0.32 

0.26 

0.19 

0.04 

0.17 
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ATOM 14 F4 TTB 

ATOM 15 CIO TTB 

ATOM 16 C6 TTB 

ATOM 17 Cll TTB 

ATOM 18 015 TTB 

ATOM 19 N14 TTB 

ATOM 20 017 TTB 

ATOM 21 C18 TTB 

ATOM 22 C20 TTB 

ATOM 23 C23 TTB 

ATOM 24 C26 TTB 

ATOM 25 C28 TTB 

ATOM 26 C29 TTB 

ATOM 27 C30 TTB 

ATOM 28 C32 TTB 



ATOM 


29 


C34 


TTB 


ATOM 


30 


F36 


TTB 


ATOM 


31 


C33 


TTB 


ATOM 


32 


C31 


TTB 


ATOM 


33 


C27 


TTB 


ATOM 


34 


C24 


TTB 


ATOM 


35 


C19 


TTB 


ATOM 


36 


022 


TTB 


ATOM 


37 


G21 


TTB 


ATOM 


38 


C25 


TTB 


ATOM 


39 


016 


TTB 


ATOM 


40 


HI 4 


TTB 


ATOM 


41 


HI 6 


TTB 



END 



-1.004 28.684 



5 0.798 31.805 

5 2.115 32.193 

5 6.976 34.145 

5 7.833 33.843 

5 6.540 35,476 

5 7.023 36.473 

5 7.909 37.429 

5 9.314 36.895 

5 10.230 36.867 

5 11.523 36.403 

5 11.908 35.965 



13.268 35.476 

13.473 34.177 

14.762 33.707 

15.852 34.538 

17.098 34.083 

15.650 35.837 

14.357 36.308 

11.000 35.995 

9.707 36.461 

■ 5.817 37.174 



5 5.068 37.798 

5 5.540 37.104 

5 4.369 37.860 

5 8.474 32.873 

5 5.863 35.730 

5 8.626 32.887 



PCTAJS2004/041329 

75.686 -0.00 0.23 

76.246 -0.00 0.14 

76.456 -0.00 0.17 

75.150 -0.00 0.29 

74^311 -0.00 0.18 

75,264 -0.00 0.11 

74.380 -0.00 0.14 

75.225 -0.00 0.23 
75.489 -0.00 0.35 
74.443 -0:00 0.24 
74.656 -0.00 0.30 
75.918 -0.00 0.45 
76.134 -0.00 0.47 
76.583 -0.00 0.49 
76.790 -0.00 0,61 
76.551 -0.00 0.57 
76.755 -0.00 0.57 
76.103 -0.00 0.46 
75.893 -0.00 0.40 
76.969 -0.00 0.41 
76.754 -0.00 0.36 
73.834 -0.00 0.06 
74.596 0.00 0.21 
72.475 0.00 0.22 
72.178 -0.00 0.01 
77.308 -0.00 0.21 
75.997 0.00 0.11 
78.289 -0.00 0.09 
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Tables 



Atomic Interactions Between I7L and TTP-A 



1 — • :: — r: — i " — 

TTP-A Atom 


i r 

I7L Atom 


TliQfance f^AnBstoms") 


Interaction type 


02 


Cys328 - bo 


^ 18 

J, xo 


Charge 


03 


Gly32y - IN 


3.62 


Charge 


CIO 


Irploo - ^0 


3.34 


Hydrophobic 


08 


Asnl71-N 


2.86 


WvHroeren bond 


C12 


Metl95 - SG 


3.74 


Hydrophobic 


025 


Asnl99-ODl 


3.26 


Hydrogen bond 


H17 


Phe236'-0 


1.86 


Hydrogen bond 


C24 


Ile203 CDl 


4.79 


Hydrophobic 


C14 


Metl95-SG 


3.33 


Hydrophobic 


02 


Ser240 - OG . 


3.76 


Hydrogen bond 



1- Designations for atoms on TTP-A are as shown in FIG, 6. 

2- Designations for atoms in I7L are as shown in Table 2 



Table6 



Atomic Interactions Between I7L and TTP-B 



TTP-BAtom* 1 I7LAtom^ 


Distance (Angstoms) 


Interaction type 


021 


Cys328 - SG 


3.07 


Electrostatic 


C25 


Metl69 - CB 


3.70 


Hydrophobic 


022 


His241-N 


3.08 


Hydrogen bond 


015 


Asnl7l-N 


2.87 


Hydrogen bond 


H14 


Leu239 -O 


2.20 


Hydrogen bond 


016 


Asnl71-ND2 


3.50 


Hydrogen bond 


C26 


Leu239 - CA 


3.06 


Hydrophobic 


C27 


Metl95 - SD 


3.24 


Hydrophobic 


C9 


Trpl68-CE3 


3.12 


Hydrophobic 


C5 


Leii239-CD1 


3,97' ' 


Hydrophobic 



1- Designations for atoms on TTP-B are as shown in.FIGt,6% r 

2- Designations for atoms in 17L are as shown in Table 2 
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